DEMVSNet: Denoising and depth inference for unstructured multi-view stereo on noised images

Jiawei Han, Xiaomei Chen*, Yongtian Zhang, Weimin Hou, Zibo Hu

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

2 引用 (Scopus)

摘要

Most deep-learning-based multi-view stereo series studies are concerned with improving the depth prediction accuracy of noise-free images. However, it is difficult to obtain off-the-set clean images in practice and 3D convolutional neural networks require a lot of computing resources. To make full use of its computing power, different types of information can be processed simultaneously in the network. For these two issues, this paper proposes a novel multi-stage network architecture to address depth inference and denoising simultaneously. Specifically, 2D feature maps are first converted into 3D cost volumes containing pixel information and depth information through differentiable homography and Gaussian probability mapping. Then, the cost volume is input into the regularisation module in each network stage to obtain the predicted probability volumes. Furthermore, simple static weights lead to training failure, and it is necessary to dynamically adjust the loss function by gradient normalisation. The proposed method can dispose of pixel information and depth information simultaneously and both reach an excellent level. Extensive experimental results show that the authors’ work surpasses the state-of-the-art denoising on the DTU dataset (adding Gaussian–Poisson noise) and is more robust to noise images in depth inference.

源语言英语
页(从-至)570-580
页数11
期刊IET Computer Vision
16
7
DOI
出版状态已出版 - 10月 2022

指纹

探究 'DEMVSNet: Denoising and depth inference for unstructured multi-view stereo on noised images' 的科研主题。它们共同构成独一无二的指纹。

引用此