Cross-spectral stereo matching for facial disparity estimation in the dark

Songnan Lin; Jiawei Zhang; Jing Chen; Yongtian Wang; Yicun Liu; Jimmy Ren

doi:10.1016/j.cviu.2020.103046

Cross-spectral stereo matching for facial disparity estimation in the dark

Songnan Lin, Jiawei Zhang^*, Jing Chen, Yongtian Wang, Yicun Liu, Jimmy Ren

^*此作品的通讯作者

光电学院

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Numerous applications on human faces hinge on depth information. Often, facial stereo matching provides an opportunity to estimate disparity without active projectors. However, existing algorithms are less effective at night due to unclear texture and severe noises in RGB images. In this paper, we address this problem by estimating facial disparity maps from NIR-RGB pairs. We develop a neural network composed of a multi-spectral transfer network (MSTN) and a disparity estimation network (DEN). MSTN is used to produce a pseudo-NIR image aligned with the RGB view using a spatially weighted sum on the NIR one by a kernel prediction network (KPN). As the pseudo-NIR and the NIR images share the same appearance, the facial disparity map is predicted by the proposed DEN with the same-spectral stereo pair. The whole network can be trained in an end-to-end manner and the experimental results demonstrate that it performs favorably against state-of-the-art algorithms on both synthetic and real data.

源语言	英语
文章编号	103046
期刊	Computer Vision and Image Understanding
卷	200
DOI	https://doi.org/10.1016/j.cviu.2020.103046
出版状态	已出版 - 11月 2020

访问文件

10.1016/j.cviu.2020.103046

其它文件与链接

链接到 Scopus 的出版物

引用此

Lin, S., Zhang, J., Chen, J., Wang, Y., Liu, Y., & Ren, J. (2020). Cross-spectral stereo matching for facial disparity estimation in the dark. Computer Vision and Image Understanding, 200, 文章 103046. https://doi.org/10.1016/j.cviu.2020.103046

@article{a1d7e57a59284b299af6165435ec9208,

title = "Cross-spectral stereo matching for facial disparity estimation in the dark",

abstract = "Numerous applications on human faces hinge on depth information. Often, facial stereo matching provides an opportunity to estimate disparity without active projectors. However, existing algorithms are less effective at night due to unclear texture and severe noises in RGB images. In this paper, we address this problem by estimating facial disparity maps from NIR-RGB pairs. We develop a neural network composed of a multi-spectral transfer network (MSTN) and a disparity estimation network (DEN). MSTN is used to produce a pseudo-NIR image aligned with the RGB view using a spatially weighted sum on the NIR one by a kernel prediction network (KPN). As the pseudo-NIR and the NIR images share the same appearance, the facial disparity map is predicted by the proposed DEN with the same-spectral stereo pair. The whole network can be trained in an end-to-end manner and the experimental results demonstrate that it performs favorably against state-of-the-art algorithms on both synthetic and real data.",

keywords = "Deep learning, Facial disparity estimation, Multi-spectral transfer",

author = "Songnan Lin and Jiawei Zhang and Jing Chen and Yongtian Wang and Yicun Liu and Jimmy Ren",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier Inc.",

year = "2020",

month = nov,

doi = "10.1016/j.cviu.2020.103046",

language = "English",

volume = "200",

journal = "Computer Vision and Image Understanding",

issn = "1077-3142",

publisher = "Academic Press Inc.",

}

TY - JOUR

T1 - Cross-spectral stereo matching for facial disparity estimation in the dark

AU - Lin, Songnan

AU - Zhang, Jiawei

AU - Chen, Jing

AU - Wang, Yongtian

AU - Liu, Yicun

AU - Ren, Jimmy

PY - 2020/11

Y1 - 2020/11

N2 - Numerous applications on human faces hinge on depth information. Often, facial stereo matching provides an opportunity to estimate disparity without active projectors. However, existing algorithms are less effective at night due to unclear texture and severe noises in RGB images. In this paper, we address this problem by estimating facial disparity maps from NIR-RGB pairs. We develop a neural network composed of a multi-spectral transfer network (MSTN) and a disparity estimation network (DEN). MSTN is used to produce a pseudo-NIR image aligned with the RGB view using a spatially weighted sum on the NIR one by a kernel prediction network (KPN). As the pseudo-NIR and the NIR images share the same appearance, the facial disparity map is predicted by the proposed DEN with the same-spectral stereo pair. The whole network can be trained in an end-to-end manner and the experimental results demonstrate that it performs favorably against state-of-the-art algorithms on both synthetic and real data.

AB - Numerous applications on human faces hinge on depth information. Often, facial stereo matching provides an opportunity to estimate disparity without active projectors. However, existing algorithms are less effective at night due to unclear texture and severe noises in RGB images. In this paper, we address this problem by estimating facial disparity maps from NIR-RGB pairs. We develop a neural network composed of a multi-spectral transfer network (MSTN) and a disparity estimation network (DEN). MSTN is used to produce a pseudo-NIR image aligned with the RGB view using a spatially weighted sum on the NIR one by a kernel prediction network (KPN). As the pseudo-NIR and the NIR images share the same appearance, the facial disparity map is predicted by the proposed DEN with the same-spectral stereo pair. The whole network can be trained in an end-to-end manner and the experimental results demonstrate that it performs favorably against state-of-the-art algorithms on both synthetic and real data.

KW - Deep learning

KW - Facial disparity estimation

KW - Multi-spectral transfer

UR - http://www.scopus.com/inward/record.url?scp=85088904811&partnerID=8YFLogxK

U2 - 10.1016/j.cviu.2020.103046

DO - 10.1016/j.cviu.2020.103046

M3 - Article

AN - SCOPUS:85088904811

SN - 1077-3142

VL - 200

JO - Computer Vision and Image Understanding

JF - Computer Vision and Image Understanding

M1 - 103046

ER -

Cross-spectral stereo matching for facial disparity estimation in the dark

摘要

访问文件

其它文件与链接

指纹

引用此