Double-Shot 3D Shape Measurement with a Dual-Branch Network for Structured Light Projection Profilometry

Mingyang Lei; Jingfan Fan; Long Shao; Hong Song; Deqiang Xiao; Danni Ai; Tianyu Fu; Yucong Lin; Ying Gu; Jian Yang

doi:10.1109/TCSVT.2024.3502134

Double-Shot 3D Shape Measurement with a Dual-Branch Network for Structured Light Projection Profilometry

Mingyang Lei, Jingfan Fan^*, Long Shao^*, Hong Song, Deqiang Xiao, Danni Ai, Tianyu Fu, Yucong Lin, Ying Gu, Jian Yang^*

^*此作品的通讯作者

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

The structured light (SL)-based three-dimensional (3D) measurement techniques with deep learning have been widely studied to improve measurement efficiency, among which fringe projection profilometry (FPP) and speckle projection profilometry (SPP) are two popular methods. However, they generally use a single projection pattern for reconstruction, resulting in fringe order ambiguity or poor reconstruction accuracy. To alleviate these problems, we propose a parallel dual-branch Convolutional Neural Network (CNN)-Transformer network (PDCNet), to take advantage of convolutional operations and self-attention mechanisms for processing different SL modalities. Within PDCNet, a Transformer branch is used to capture global perception in the fringe images, while a CNN branch is designed to collect local details in the speckle images. To fully integrate complementary features, we design a double-stream attention aggregation module (DAAM) that consists of a parallel attention subnetwork for aggregating multi-scale spatial structure information. This module can dynamically retain local and global representations to the maximum extent. Moreover, an adaptive mixture density head with bimodal Gaussian distribution is proposed for learning a representation that is precise near discontinuities. Compared to the standard disparity regression strategy, this adaptive mixture head can effectively improve performance at object boundaries. Extensive experiments demonstrate that our method can reduce fringe order ambiguity while producing high-accuracy results on self-made datasets.

源语言	英语
期刊	IEEE Transactions on Circuits and Systems for Video Technology
DOI	https://doi.org/10.1109/TCSVT.2024.3502134
出版状态	已接受/待刊 - 2024

访问文件

10.1109/TCSVT.2024.3502134

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{ea0a7f971b524bbf93bd88883199a423,

title = "Double-Shot 3D Shape Measurement with a Dual-Branch Network for Structured Light Projection Profilometry",

abstract = "The structured light (SL)-based three-dimensional (3D) measurement techniques with deep learning have been widely studied to improve measurement efficiency, among which fringe projection profilometry (FPP) and speckle projection profilometry (SPP) are two popular methods. However, they generally use a single projection pattern for reconstruction, resulting in fringe order ambiguity or poor reconstruction accuracy. To alleviate these problems, we propose a parallel dual-branch Convolutional Neural Network (CNN)-Transformer network (PDCNet), to take advantage of convolutional operations and self-attention mechanisms for processing different SL modalities. Within PDCNet, a Transformer branch is used to capture global perception in the fringe images, while a CNN branch is designed to collect local details in the speckle images. To fully integrate complementary features, we design a double-stream attention aggregation module (DAAM) that consists of a parallel attention subnetwork for aggregating multi-scale spatial structure information. This module can dynamically retain local and global representations to the maximum extent. Moreover, an adaptive mixture density head with bimodal Gaussian distribution is proposed for learning a representation that is precise near discontinuities. Compared to the standard disparity regression strategy, this adaptive mixture head can effectively improve performance at object boundaries. Extensive experiments demonstrate that our method can reduce fringe order ambiguity while producing high-accuracy results on self-made datasets.",

keywords = "Attention Mechanism, Deep Learning, Dual-Branch Framework, Structured-Light Projection Profilometry",

author = "Mingyang Lei and Jingfan Fan and Long Shao and Hong Song and Deqiang Xiao and Danni Ai and Tianyu Fu and Yucong Lin and Ying Gu and Jian Yang",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2024",

doi = "10.1109/TCSVT.2024.3502134",

language = "English",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Double-Shot 3D Shape Measurement with a Dual-Branch Network for Structured Light Projection Profilometry

AU - Lei, Mingyang

AU - Fan, Jingfan

AU - Shao, Long

AU - Song, Hong

AU - Xiao, Deqiang

AU - Ai, Danni

AU - Fu, Tianyu

AU - Lin, Yucong

AU - Gu, Ying

AU - Yang, Jian

PY - 2024

Y1 - 2024

N2 - The structured light (SL)-based three-dimensional (3D) measurement techniques with deep learning have been widely studied to improve measurement efficiency, among which fringe projection profilometry (FPP) and speckle projection profilometry (SPP) are two popular methods. However, they generally use a single projection pattern for reconstruction, resulting in fringe order ambiguity or poor reconstruction accuracy. To alleviate these problems, we propose a parallel dual-branch Convolutional Neural Network (CNN)-Transformer network (PDCNet), to take advantage of convolutional operations and self-attention mechanisms for processing different SL modalities. Within PDCNet, a Transformer branch is used to capture global perception in the fringe images, while a CNN branch is designed to collect local details in the speckle images. To fully integrate complementary features, we design a double-stream attention aggregation module (DAAM) that consists of a parallel attention subnetwork for aggregating multi-scale spatial structure information. This module can dynamically retain local and global representations to the maximum extent. Moreover, an adaptive mixture density head with bimodal Gaussian distribution is proposed for learning a representation that is precise near discontinuities. Compared to the standard disparity regression strategy, this adaptive mixture head can effectively improve performance at object boundaries. Extensive experiments demonstrate that our method can reduce fringe order ambiguity while producing high-accuracy results on self-made datasets.

AB - The structured light (SL)-based three-dimensional (3D) measurement techniques with deep learning have been widely studied to improve measurement efficiency, among which fringe projection profilometry (FPP) and speckle projection profilometry (SPP) are two popular methods. However, they generally use a single projection pattern for reconstruction, resulting in fringe order ambiguity or poor reconstruction accuracy. To alleviate these problems, we propose a parallel dual-branch Convolutional Neural Network (CNN)-Transformer network (PDCNet), to take advantage of convolutional operations and self-attention mechanisms for processing different SL modalities. Within PDCNet, a Transformer branch is used to capture global perception in the fringe images, while a CNN branch is designed to collect local details in the speckle images. To fully integrate complementary features, we design a double-stream attention aggregation module (DAAM) that consists of a parallel attention subnetwork for aggregating multi-scale spatial structure information. This module can dynamically retain local and global representations to the maximum extent. Moreover, an adaptive mixture density head with bimodal Gaussian distribution is proposed for learning a representation that is precise near discontinuities. Compared to the standard disparity regression strategy, this adaptive mixture head can effectively improve performance at object boundaries. Extensive experiments demonstrate that our method can reduce fringe order ambiguity while producing high-accuracy results on self-made datasets.

KW - Attention Mechanism

KW - Deep Learning

KW - Dual-Branch Framework

KW - Structured-Light Projection Profilometry

UR - http://www.scopus.com/inward/record.url?scp=85210123617&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2024.3502134

DO - 10.1109/TCSVT.2024.3502134

M3 - Article

AN - SCOPUS:85210123617

SN - 1051-8215

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

ER -

Double-Shot 3D Shape Measurement with a Dual-Branch Network for Structured Light Projection Profilometry

摘要

访问文件

其它文件与链接

指纹

引用此