TY - JOUR
T1 - Learning to Predict 3D Meshes from a Single Image via Depth Consistency
AU - Huang, Hao
AU - Liu, Shaoli
AU - Liu, Jianhua
AU - Jin, Peng
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Reconstructing three-dimensional (3D) shapes from a single image remains a significant challenge in computer vision due to the inherent ambiguity caused by missing or occluded shape information. Previous studies have predominantly focused on mesh models supervised by multi-view silhouettes. However, such methods are limited in reconstructing fine details. In this study, a 3D mesh model is predicted from a single image, leveraging depth consistency and without requiring viewpoint pose annotations. The model effectively learns strong shape priors that preserve finer structures and accurately predicts view poses from "correlation-supervised" viewpoints. Additionally, standard deviation and Laplacian losses were employed to regulate mesh edge distribution, resulting in more precise reconstructions. Differentiable renderer functions were derived from the 3D mesh to generate depth maps. Compared to conventional approaches, the proposed method provided superior representation of subtle structures. When applied to both synthetic and real-world datasets, the model outperformed existing methods in view-based 3D reconstruction tasks.
AB - Reconstructing three-dimensional (3D) shapes from a single image remains a significant challenge in computer vision due to the inherent ambiguity caused by missing or occluded shape information. Previous studies have predominantly focused on mesh models supervised by multi-view silhouettes. However, such methods are limited in reconstructing fine details. In this study, a 3D mesh model is predicted from a single image, leveraging depth consistency and without requiring viewpoint pose annotations. The model effectively learns strong shape priors that preserve finer structures and accurately predicts view poses from "correlation-supervised" viewpoints. Additionally, standard deviation and Laplacian losses were employed to regulate mesh edge distribution, resulting in more precise reconstructions. Differentiable renderer functions were derived from the 3D mesh to generate depth maps. Compared to conventional approaches, the proposed method provided superior representation of subtle structures. When applied to both synthetic and real-world datasets, the model outperformed existing methods in view-based 3D reconstruction tasks.
KW - Depth-consistency
KW - Mesh
KW - Standard deviation loss
KW - View-based reconstruction
UR - https://www.scopus.com/pages/publications/105014604905
U2 - 10.1186/s10033-025-01335-2
DO - 10.1186/s10033-025-01335-2
M3 - Article
AN - SCOPUS:105014604905
SN - 1000-9345
VL - 38
JO - Chinese Journal of Mechanical Engineering (English Edition)
JF - Chinese Journal of Mechanical Engineering (English Edition)
IS - 1
M1 - 165
ER -