Unsupervised learning of depth from monocular videos using 3D-2D corresponding constraints

Fusheng Jin*, Yu Zhao, Chuanbing Wan, Ye Yuan, Shuliang Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Depth estimation can provide tremendous help for object detection, localization, path planning, etc. However, the existing methods based on deep learning have high requirements on computing power and often cannot be directly applied to autonomous moving platforms (AMP). Fifth-generation (5G) mobile and wireless communication systems have attracted the attention of researchers because it provides the network foundation for cloud computing and edge computing, which makes it possible to utilize deep learning method on AMP. This paper proposes a depth prediction method for AMP based on unsupervised learning, which can learn from video sequences and simultaneously estimate the depth structure of the scene and the ego-motion. Compared with the existing unsupervised learning methods, our method makes the spatial correspondence among pixel points consistent with the image area by smoothing the 3D corresponding vector field based on 2D image, which effectively improves the depth prediction ability of the neural network. Our experiments on the KITTI driving dataset demonstrated that our method outperformed other previous learning-based methods. The results on the Apolloscape and Cityscapes datasets show that our proposed method has a strong universality.

源语言英语
文章编号1764
期刊Remote Sensing
13
9
DOI
出版状态已出版 - 1 5月 2021

指纹

探究 'Unsupervised learning of depth from monocular videos using 3D-2D corresponding constraints' 的科研主题。它们共同构成独一无二的指纹。

引用此