3D shape reconstruction from images in the frequency domain

Weichao Shen; Yunde Jia; Yuwei Wu

doi:10.1109/CVPR.2019.00460

3D shape reconstruction from images in the frequency domain

Weichao Shen, Yunde Jia, Yuwei Wu^*

^*此作品的通讯作者

计算机学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

11 引用（Scopus）

摘要

Reconstructing the high-resolution volumetric 3D shape from images is challenging due to the cubic growth of computational cost. In this paper, we propose a Fourier-based method that reconstructs a 3D shape from images in a 2D space by predicting slices in the frequency domain. According to the Fourier slice projection theorem, we introduce a thickness map to bridge the domain gap between images in the spatial domain and slices in the frequency domain. The thickness map is the 2D spatial projection of the 3D shape, which is easily predicted from the input image by a general convolutional neural network. Each slice in the frequency domain is the Fourier transform of the corresponding thickness map. All slices constitute a 3D descriptor and the 3D shape is the inverse Fourier transform of the descriptor. Using slices in the frequency domain, our method can transfer the 3D shape reconstruction from the 3D space into the 2D space, which significantly reduces the computational cost. The experiment results on the ShapeNet dataset demonstrate that our method achieves competitive reconstruction accuracy and computational efficiency compared with the state-of-the-art reconstruction methods.

源语言	英语
主期刊名	Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019
出版商	IEEE Computer Society
页	4466-4474
页数	9
ISBN（电子版）	9781728132938
DOI	https://doi.org/10.1109/CVPR.2019.00460
出版状态	已出版 - 6月 2019
活动	32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019 - Long Beach, 美国期限: 16 6月 2019 → 20 6月 2019

出版系列

姓名	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
卷	2019-June
ISSN（印刷版）	1063-6919

会议

会议	32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019
国家/地区	美国
市	Long Beach
时期	16/06/19 → 20/06/19

访问文件

10.1109/CVPR.2019.00460

其它文件与链接

链接到 Scopus 的出版物

引用此

Shen, W., Jia, Y., & Wu, Y. (2019). 3D shape reconstruction from images in the frequency domain. 在 Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019 (页码 4466-4474). 文章 8953854 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2019-June). IEEE Computer Society. https://doi.org/10.1109/CVPR.2019.00460

@inproceedings{3193132418fc413294852262f66483b5,

title = "3D shape reconstruction from images in the frequency domain",

abstract = "Reconstructing the high-resolution volumetric 3D shape from images is challenging due to the cubic growth of computational cost. In this paper, we propose a Fourier-based method that reconstructs a 3D shape from images in a 2D space by predicting slices in the frequency domain. According to the Fourier slice projection theorem, we introduce a thickness map to bridge the domain gap between images in the spatial domain and slices in the frequency domain. The thickness map is the 2D spatial projection of the 3D shape, which is easily predicted from the input image by a general convolutional neural network. Each slice in the frequency domain is the Fourier transform of the corresponding thickness map. All slices constitute a 3D descriptor and the 3D shape is the inverse Fourier transform of the descriptor. Using slices in the frequency domain, our method can transfer the 3D shape reconstruction from the 3D space into the 2D space, which significantly reduces the computational cost. The experiment results on the ShapeNet dataset demonstrate that our method achieves competitive reconstruction accuracy and computational efficiency compared with the state-of-the-art reconstruction methods.",

keywords = "3D from Single Image",

author = "Weichao Shen and Yunde Jia and Yuwei Wu",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019 ; Conference date: 16-06-2019 Through 20-06-2019",

year = "2019",

month = jun,

doi = "10.1109/CVPR.2019.00460",

language = "English",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "4466--4474",

booktitle = "Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019",

address = "United States",

}

Shen, W, Jia, Y & Wu, Y 2019, 3D shape reconstruction from images in the frequency domain. 在 Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019., 8953854, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 卷 2019-June, IEEE Computer Society, 页码 4466-4474, 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019, Long Beach, 美国, 16/06/19. https://doi.org/10.1109/CVPR.2019.00460

3D shape reconstruction from images in the frequency domain. / Shen, Weichao; Jia, Yunde; Wu, Yuwei.
Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019. IEEE Computer Society, 2019. 页码 4466-4474 8953854 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2019-June).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - 3D shape reconstruction from images in the frequency domain

AU - Shen, Weichao

AU - Jia, Yunde

AU - Wu, Yuwei

PY - 2019/6

Y1 - 2019/6

N2 - Reconstructing the high-resolution volumetric 3D shape from images is challenging due to the cubic growth of computational cost. In this paper, we propose a Fourier-based method that reconstructs a 3D shape from images in a 2D space by predicting slices in the frequency domain. According to the Fourier slice projection theorem, we introduce a thickness map to bridge the domain gap between images in the spatial domain and slices in the frequency domain. The thickness map is the 2D spatial projection of the 3D shape, which is easily predicted from the input image by a general convolutional neural network. Each slice in the frequency domain is the Fourier transform of the corresponding thickness map. All slices constitute a 3D descriptor and the 3D shape is the inverse Fourier transform of the descriptor. Using slices in the frequency domain, our method can transfer the 3D shape reconstruction from the 3D space into the 2D space, which significantly reduces the computational cost. The experiment results on the ShapeNet dataset demonstrate that our method achieves competitive reconstruction accuracy and computational efficiency compared with the state-of-the-art reconstruction methods.

AB - Reconstructing the high-resolution volumetric 3D shape from images is challenging due to the cubic growth of computational cost. In this paper, we propose a Fourier-based method that reconstructs a 3D shape from images in a 2D space by predicting slices in the frequency domain. According to the Fourier slice projection theorem, we introduce a thickness map to bridge the domain gap between images in the spatial domain and slices in the frequency domain. The thickness map is the 2D spatial projection of the 3D shape, which is easily predicted from the input image by a general convolutional neural network. Each slice in the frequency domain is the Fourier transform of the corresponding thickness map. All slices constitute a 3D descriptor and the 3D shape is the inverse Fourier transform of the descriptor. Using slices in the frequency domain, our method can transfer the 3D shape reconstruction from the 3D space into the 2D space, which significantly reduces the computational cost. The experiment results on the ShapeNet dataset demonstrate that our method achieves competitive reconstruction accuracy and computational efficiency compared with the state-of-the-art reconstruction methods.

KW - 3D from Single Image

UR - http://www.scopus.com/inward/record.url?scp=85078769633&partnerID=8YFLogxK

U2 - 10.1109/CVPR.2019.00460

DO - 10.1109/CVPR.2019.00460

M3 - Conference contribution

AN - SCOPUS:85078769633

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 4466

EP - 4474

BT - Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019

PB - IEEE Computer Society

T2 - 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019

Y2 - 16 June 2019 through 20 June 2019

ER -

3D shape reconstruction from images in the frequency domain

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此