Skip to main navigation Skip to search Skip to main content

Learning to Predict 3D Meshes from a Single Image via Depth Consistency

  • Beijing Institute of Technology
  • Shenzhen Bay Laboratory

Research output: Contribution to journalArticlepeer-review

Abstract

Reconstructing three-dimensional (3D) shapes from a single image remains a significant challenge in computer vision due to the inherent ambiguity caused by missing or occluded shape information. Previous studies have predominantly focused on mesh models supervised by multi-view silhouettes. However, such methods are limited in reconstructing fine details. In this study, a 3D mesh model is predicted from a single image, leveraging depth consistency and without requiring viewpoint pose annotations. The model effectively learns strong shape priors that preserve finer structures and accurately predicts view poses from "correlation-supervised" viewpoints. Additionally, standard deviation and Laplacian losses were employed to regulate mesh edge distribution, resulting in more precise reconstructions. Differentiable renderer functions were derived from the 3D mesh to generate depth maps. Compared to conventional approaches, the proposed method provided superior representation of subtle structures. When applied to both synthetic and real-world datasets, the model outperformed existing methods in view-based 3D reconstruction tasks.

Original languageEnglish
Article number165
JournalChinese Journal of Mechanical Engineering (English Edition)
Volume38
Issue number1
DOIs
Publication statusPublished - Dec 2025
Externally publishedYes

Keywords

  • Depth-consistency
  • Mesh
  • Standard deviation loss
  • View-based reconstruction

Fingerprint

Dive into the research topics of 'Learning to Predict 3D Meshes from a Single Image via Depth Consistency'. Together they form a unique fingerprint.

Cite this