TY - GEN
T1 - Monocular depth estimation of outdoor scenes using RGB-D datasets
AU - Bi, Tianteng
AU - Liu, Yue
AU - Weng, Dongdong
AU - Wang, Yongtian
N1 - Publisher Copyright:
© Springer International Publishing AG 2017.
PY - 2017
Y1 - 2017
N2 - Depth estimation is a classical topic in computer vision, however, inferring the depth of a scene from a single image remains an extremely difficult problem. In this paper, a non-parametric method is adopted to obtain the depth of a single image. To this end, RGB-D datasets are exploited as the inference basis. Given a query image, a global scene-level retrieval is performed against the dataset, followed by a superpixel-level matching. The superpixels-based scene representation is introduced to model the depth jointly in terms of superpixel centroid. The depth estimation is formulated as contextual inference and the depth propagation. The contextual inference is expressed as a Markov random field (MRF) energy function defined on a sparse depth map obtained by the matching process and implemented in a graphical model whose edges encode the interactions between the superpixel centroids. Then the depth propagation generates the final dense depth map from the inferred result. The benefits of the proposed method is demonstrated on the standard dataset.
AB - Depth estimation is a classical topic in computer vision, however, inferring the depth of a scene from a single image remains an extremely difficult problem. In this paper, a non-parametric method is adopted to obtain the depth of a single image. To this end, RGB-D datasets are exploited as the inference basis. Given a query image, a global scene-level retrieval is performed against the dataset, followed by a superpixel-level matching. The superpixels-based scene representation is introduced to model the depth jointly in terms of superpixel centroid. The depth estimation is formulated as contextual inference and the depth propagation. The contextual inference is expressed as a Markov random field (MRF) energy function defined on a sparse depth map obtained by the matching process and implemented in a graphical model whose edges encode the interactions between the superpixel centroids. Then the depth propagation generates the final dense depth map from the inferred result. The benefits of the proposed method is demonstrated on the standard dataset.
UR - http://www.scopus.com/inward/record.url?scp=85016122458&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-54427-4_7
DO - 10.1007/978-3-319-54427-4_7
M3 - Conference contribution
AN - SCOPUS:85016122458
SN - 9783319544267
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 88
EP - 99
BT - Computer Vision - ACCV 2016 Workshops, ACCV 2016 International Workshops, Revised Selected Papers
A2 - Ma, Kai-Kuang
A2 - Lu, Jiwen
A2 - Chen, Chu-Song
PB - Springer Verlag
T2 - 13th Asian Conference on Computer Vision, ACCV 2016
Y2 - 20 November 2016 through 24 November 2016
ER -