Abstract
Self-supervised method has shown great potential in monocular depth estimation, since it does not need expensive ground-truth depth labels but only uses the photometric error of synthesized images as the supervision signal. However, although many methods have been proposed to improve its performance, the occlusion problem has not been clearly handled. This paper introduces a novel view synthesis module to deal with occluded pixels in the process of image reconstruction. Specifically, we use bilinear splatting to forward warp the source image, and average pixels projected to the same location by the predicted depth. In addition, a valid pixel mask is generated with projection to ignore invalid pixels. The proposed approach clearly handles overlapping pixels and invalid areas of the synthesized image, thus improving the performance of self-supervised learning. We conduct various experiments, and the results show that our model can generate clear and complete depth maps and achieves state-of-the-art performance.
| Original language | English |
|---|---|
| Title of host publication | 2023 9th International Conference on Electrical Engineering, Control and Robotics, EECR 2023 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 274-279 |
| Number of pages | 6 |
| ISBN (Electronic) | 9781665491204 |
| DOIs | |
| Publication status | Published - 2023 |
| Event | 9th International Conference on Electrical Engineering, Control and Robotics, EECR 2023 - Wuhan, China Duration: 24 Feb 2023 → 26 Feb 2023 |
Publication series
| Name | 2023 9th International Conference on Electrical Engineering, Control and Robotics, EECR 2023 |
|---|
Conference
| Conference | 9th International Conference on Electrical Engineering, Control and Robotics, EECR 2023 |
|---|---|
| Country/Territory | China |
| City | Wuhan |
| Period | 24/02/23 → 26/02/23 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- KITTI dataset
- monocular depth estimation
- self-supervised learning
- view synthesis
Fingerprint
Dive into the research topics of 'Splatting-Based View Synthesis for Self-supervised Monocular Depth Estimation'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver