Splatting-Based View Synthesis for Self-supervised Monocular Depth Estimation

Jiahao Liu, Jianghao Leng, Bo Liu, Wenyi Huang, Chao Sun*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Self-supervised method has shown great potential in monocular depth estimation, since it does not need expensive ground-truth depth labels but only uses the photometric error of synthesized images as the supervision signal. However, although many methods have been proposed to improve its performance, the occlusion problem has not been clearly handled. This paper introduces a novel view synthesis module to deal with occluded pixels in the process of image reconstruction. Specifically, we use bilinear splatting to forward warp the source image, and average pixels projected to the same location by the predicted depth. In addition, a valid pixel mask is generated with projection to ignore invalid pixels. The proposed approach clearly handles overlapping pixels and invalid areas of the synthesized image, thus improving the performance of self-supervised learning. We conduct various experiments, and the results show that our model can generate clear and complete depth maps and achieves state-of-the-art performance.

Original languageEnglish
Title of host publication2023 9th International Conference on Electrical Engineering, Control and Robotics, EECR 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages274-279
Number of pages6
ISBN (Electronic)9781665491204
DOIs
Publication statusPublished - 2023
Event9th International Conference on Electrical Engineering, Control and Robotics, EECR 2023 - Wuhan, China
Duration: 24 Feb 202326 Feb 2023

Publication series

Name2023 9th International Conference on Electrical Engineering, Control and Robotics, EECR 2023

Conference

Conference9th International Conference on Electrical Engineering, Control and Robotics, EECR 2023
Country/TerritoryChina
CityWuhan
Period24/02/2326/02/23

Keywords

  • KITTI dataset
  • monocular depth estimation
  • self-supervised learning
  • view synthesis

Fingerprint

Dive into the research topics of 'Splatting-Based View Synthesis for Self-supervised Monocular Depth Estimation'. Together they form a unique fingerprint.

Cite this