Weakly-Supervised Single-view Dense 3D Point Cloud Reconstruction via Differentiable Renderer

Peng Jin; Shaoli Liu; Jianhua Liu; Hao Huang; Linlin Yang; Michael Weinmann; Reinhard Klein

doi:10.1186/s10033-021-00615-x

Weakly-Supervised Single-view Dense 3D Point Cloud Reconstruction via Differentiable Renderer

Peng Jin, Shaoli Liu, Jianhua Liu^*, Hao Huang, Linlin Yang, Michael Weinmann, Reinhard Klein

^*Corresponding author for this work

School of Mechanical Engineering

Research output: Contribution to journal › Article › peer-review

8 Citations (Scopus)

Abstract

In recent years, addressing ill-posed problems by leveraging prior knowledge contained in databases on learning techniques has gained much attention. In this paper, we focus on complete three-dimensional (3D) point cloud reconstruction based on a single red-green-blue (RGB) image, a task that cannot be approached using classical reconstruction techniques. For this purpose, we used an encoder-decoder framework to encode the RGB information in latent space, and to predict the 3D structure of the considered object from different viewpoints. The individual predictions are combined to yield a common representation that is used in a module combining camera pose estimation and rendering, thereby achieving differentiability with respect to imaging process and the camera pose, and optimization of the two-dimensional prediction error of novel viewpoints. Thus, our method allows end-to-end training and does not require supervision based on additional ground-truth (GT) mask annotations or ground-truth camera pose annotations. Our evaluation of synthetic and real-world data demonstrates the robustness of our approach to appearance changes and self-occlusions, through outperformance of current state-of-the-art methods in terms of accuracy, density, and model completeness.

Original language	English
Article number	93
Journal	Chinese Journal of Mechanical Engineering (English Edition)
Volume	34
Issue number	1
DOIs	https://doi.org/10.1186/s10033-021-00615-x
Publication status	Published - Dec 2021

Keywords

Differentiable renderer
Neural networks
Point clouds reconstruction
Single-view configuration

Access to Document

10.1186/s10033-021-00615-x

Cite this

@article{9d65a09b8eec4eddb39a914a74ee3c9b,

title = "Weakly-Supervised Single-view Dense 3D Point Cloud Reconstruction via Differentiable Renderer",

abstract = "In recent years, addressing ill-posed problems by leveraging prior knowledge contained in databases on learning techniques has gained much attention. In this paper, we focus on complete three-dimensional (3D) point cloud reconstruction based on a single red-green-blue (RGB) image, a task that cannot be approached using classical reconstruction techniques. For this purpose, we used an encoder-decoder framework to encode the RGB information in latent space, and to predict the 3D structure of the considered object from different viewpoints. The individual predictions are combined to yield a common representation that is used in a module combining camera pose estimation and rendering, thereby achieving differentiability with respect to imaging process and the camera pose, and optimization of the two-dimensional prediction error of novel viewpoints. Thus, our method allows end-to-end training and does not require supervision based on additional ground-truth (GT) mask annotations or ground-truth camera pose annotations. Our evaluation of synthetic and real-world data demonstrates the robustness of our approach to appearance changes and self-occlusions, through outperformance of current state-of-the-art methods in terms of accuracy, density, and model completeness.",

keywords = "Differentiable renderer, Neural networks, Point clouds reconstruction, Single-view configuration",

author = "Peng Jin and Shaoli Liu and Jianhua Liu and Hao Huang and Linlin Yang and Michael Weinmann and Reinhard Klein",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s).",

year = "2021",

month = dec,

doi = "10.1186/s10033-021-00615-x",

language = "English",

volume = "34",

journal = "Chinese Journal of Mechanical Engineering (English Edition)",

issn = "1000-9345",

publisher = "Chinese Mechanical Engineering Society",

number = "1",

}

TY - JOUR

T1 - Weakly-Supervised Single-view Dense 3D Point Cloud Reconstruction via Differentiable Renderer

AU - Jin, Peng

AU - Liu, Shaoli

AU - Liu, Jianhua

AU - Huang, Hao

AU - Yang, Linlin

AU - Weinmann, Michael

AU - Klein, Reinhard

PY - 2021/12

Y1 - 2021/12

N2 - In recent years, addressing ill-posed problems by leveraging prior knowledge contained in databases on learning techniques has gained much attention. In this paper, we focus on complete three-dimensional (3D) point cloud reconstruction based on a single red-green-blue (RGB) image, a task that cannot be approached using classical reconstruction techniques. For this purpose, we used an encoder-decoder framework to encode the RGB information in latent space, and to predict the 3D structure of the considered object from different viewpoints. The individual predictions are combined to yield a common representation that is used in a module combining camera pose estimation and rendering, thereby achieving differentiability with respect to imaging process and the camera pose, and optimization of the two-dimensional prediction error of novel viewpoints. Thus, our method allows end-to-end training and does not require supervision based on additional ground-truth (GT) mask annotations or ground-truth camera pose annotations. Our evaluation of synthetic and real-world data demonstrates the robustness of our approach to appearance changes and self-occlusions, through outperformance of current state-of-the-art methods in terms of accuracy, density, and model completeness.

AB - In recent years, addressing ill-posed problems by leveraging prior knowledge contained in databases on learning techniques has gained much attention. In this paper, we focus on complete three-dimensional (3D) point cloud reconstruction based on a single red-green-blue (RGB) image, a task that cannot be approached using classical reconstruction techniques. For this purpose, we used an encoder-decoder framework to encode the RGB information in latent space, and to predict the 3D structure of the considered object from different viewpoints. The individual predictions are combined to yield a common representation that is used in a module combining camera pose estimation and rendering, thereby achieving differentiability with respect to imaging process and the camera pose, and optimization of the two-dimensional prediction error of novel viewpoints. Thus, our method allows end-to-end training and does not require supervision based on additional ground-truth (GT) mask annotations or ground-truth camera pose annotations. Our evaluation of synthetic and real-world data demonstrates the robustness of our approach to appearance changes and self-occlusions, through outperformance of current state-of-the-art methods in terms of accuracy, density, and model completeness.

KW - Differentiable renderer

KW - Neural networks

KW - Point clouds reconstruction

KW - Single-view configuration

UR - http://www.scopus.com/inward/record.url?scp=85116006228&partnerID=8YFLogxK

U2 - 10.1186/s10033-021-00615-x

DO - 10.1186/s10033-021-00615-x

M3 - Article

AN - SCOPUS:85116006228

SN - 1000-9345

VL - 34

JO - Chinese Journal of Mechanical Engineering (English Edition)

JF - Chinese Journal of Mechanical Engineering (English Edition)

IS - 1

M1 - 93

ER -

Weakly-Supervised Single-view Dense 3D Point Cloud Reconstruction via Differentiable Renderer

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this