Abstract
Reconstructing three-dimensional (3D) shapes from a single image remains a significant challenge in computer vision due to the inherent ambiguity caused by missing or occluded shape information. Previous studies have predominantly focused on mesh models supervised by multi-view silhouettes. However, such methods are limited in reconstructing fine details. In this study, a 3D mesh model is predicted from a single image, leveraging depth consistency and without requiring viewpoint pose annotations. The model effectively learns strong shape priors that preserve finer structures and accurately predicts view poses from "correlation-supervised" viewpoints. Additionally, standard deviation and Laplacian losses were employed to regulate mesh edge distribution, resulting in more precise reconstructions. Differentiable renderer functions were derived from the 3D mesh to generate depth maps. Compared to conventional approaches, the proposed method provided superior representation of subtle structures. When applied to both synthetic and real-world datasets, the model outperformed existing methods in view-based 3D reconstruction tasks.
| Original language | English |
|---|---|
| Article number | 165 |
| Journal | Chinese Journal of Mechanical Engineering (English Edition) |
| Volume | 38 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - Dec 2025 |
| Externally published | Yes |
Keywords
- Depth-consistency
- Mesh
- Standard deviation loss
- View-based reconstruction
Fingerprint
Dive into the research topics of 'Learning to Predict 3D Meshes from a Single Image via Depth Consistency'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver