Facial component-landmark detection with weakly-supervised LR-CNN

Ruiheng Zhang; Chengpo Mu; Min Xu; Lixin Xu; Xiaofeng Xu

doi:10.1109/ACCESS.2018.2890573

Facial component-landmark detection with weakly-supervised LR-CNN

Ruiheng Zhang, Chengpo Mu, Min Xu^*, Lixin Xu, Xiaofeng Xu

^*Corresponding author for this work

School of Mechatronical Engineering

Research output: Contribution to journal › Article › peer-review

10 Citations (Scopus)

Abstract

In this paper, we propose a weakly supervised landmark-region-based convolutional neural network (LR-CNN) framework to detect facial component and landmark simultaneously. Most of the existing course-to-fine facial detectors fail to detect landmark accurately without lots of fully labeled data, which are costly to obtain. We can handle the task with a small amount of finely labeled data. First, deep convolutional generative adversarial networks are utilized to generate training samples with weak labels, as data preparation. Then, through weakly supervised learning, our LR-CNN model can be trained effectively with a small amount of finely labeled data and a large amount of generated weakly labeled data. Notably, our approach can handle the situation when large occlusion areas occur, as we localize visible facial components before predicting corresponding landmarks. Detecting unblocked components first helps us to focus on the informative area, resulting in a better performance. Additionally, to improve the performance of the above tasks, we design two models as follows: 1) we add AnchorAlign in the region proposal networks to accurately localize components and 2) we propose a two-branch model consisting classification branch and regression branch to detect landmark. Extensive evaluations on benchmark datasets indicate that our proposed approach is able to complete the multi-task facial detection and outperforms the state-of-the-art facial component and landmark detection algorithms.

Original language	English
Article number	8598858
Pages (from-to)	10263-10277
Number of pages	15
Journal	IEEE Access
Volume	7
DOIs	https://doi.org/10.1109/ACCESS.2018.2890573
Publication status	Published - 2019

Keywords

Weakly-supervised
facial landmark
generative adversarial network
region-based convolutional neural network

Access to Document

10.1109/ACCESS.2018.2890573

Cite this

@article{f0e6b3d3afa94dd1b003205605914f0d,

title = "Facial component-landmark detection with weakly-supervised LR-CNN",

abstract = "In this paper, we propose a weakly supervised landmark-region-based convolutional neural network (LR-CNN) framework to detect facial component and landmark simultaneously. Most of the existing course-to-fine facial detectors fail to detect landmark accurately without lots of fully labeled data, which are costly to obtain. We can handle the task with a small amount of finely labeled data. First, deep convolutional generative adversarial networks are utilized to generate training samples with weak labels, as data preparation. Then, through weakly supervised learning, our LR-CNN model can be trained effectively with a small amount of finely labeled data and a large amount of generated weakly labeled data. Notably, our approach can handle the situation when large occlusion areas occur, as we localize visible facial components before predicting corresponding landmarks. Detecting unblocked components first helps us to focus on the informative area, resulting in a better performance. Additionally, to improve the performance of the above tasks, we design two models as follows: 1) we add AnchorAlign in the region proposal networks to accurately localize components and 2) we propose a two-branch model consisting classification branch and regression branch to detect landmark. Extensive evaluations on benchmark datasets indicate that our proposed approach is able to complete the multi-task facial detection and outperforms the state-of-the-art facial component and landmark detection algorithms.",

keywords = "Weakly-supervised, facial landmark, generative adversarial network, region-based convolutional neural network",

author = "Ruiheng Zhang and Chengpo Mu and Min Xu and Lixin Xu and Xiaofeng Xu",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2019",

doi = "10.1109/ACCESS.2018.2890573",

language = "English",

volume = "7",

pages = "10263--10277",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Facial component-landmark detection with weakly-supervised LR-CNN

AU - Zhang, Ruiheng

AU - Mu, Chengpo

AU - Xu, Min

AU - Xu, Lixin

AU - Xu, Xiaofeng

PY - 2019

Y1 - 2019

N2 - In this paper, we propose a weakly supervised landmark-region-based convolutional neural network (LR-CNN) framework to detect facial component and landmark simultaneously. Most of the existing course-to-fine facial detectors fail to detect landmark accurately without lots of fully labeled data, which are costly to obtain. We can handle the task with a small amount of finely labeled data. First, deep convolutional generative adversarial networks are utilized to generate training samples with weak labels, as data preparation. Then, through weakly supervised learning, our LR-CNN model can be trained effectively with a small amount of finely labeled data and a large amount of generated weakly labeled data. Notably, our approach can handle the situation when large occlusion areas occur, as we localize visible facial components before predicting corresponding landmarks. Detecting unblocked components first helps us to focus on the informative area, resulting in a better performance. Additionally, to improve the performance of the above tasks, we design two models as follows: 1) we add AnchorAlign in the region proposal networks to accurately localize components and 2) we propose a two-branch model consisting classification branch and regression branch to detect landmark. Extensive evaluations on benchmark datasets indicate that our proposed approach is able to complete the multi-task facial detection and outperforms the state-of-the-art facial component and landmark detection algorithms.

AB - In this paper, we propose a weakly supervised landmark-region-based convolutional neural network (LR-CNN) framework to detect facial component and landmark simultaneously. Most of the existing course-to-fine facial detectors fail to detect landmark accurately without lots of fully labeled data, which are costly to obtain. We can handle the task with a small amount of finely labeled data. First, deep convolutional generative adversarial networks are utilized to generate training samples with weak labels, as data preparation. Then, through weakly supervised learning, our LR-CNN model can be trained effectively with a small amount of finely labeled data and a large amount of generated weakly labeled data. Notably, our approach can handle the situation when large occlusion areas occur, as we localize visible facial components before predicting corresponding landmarks. Detecting unblocked components first helps us to focus on the informative area, resulting in a better performance. Additionally, to improve the performance of the above tasks, we design two models as follows: 1) we add AnchorAlign in the region proposal networks to accurately localize components and 2) we propose a two-branch model consisting classification branch and regression branch to detect landmark. Extensive evaluations on benchmark datasets indicate that our proposed approach is able to complete the multi-task facial detection and outperforms the state-of-the-art facial component and landmark detection algorithms.

KW - Weakly-supervised

KW - facial landmark

KW - generative adversarial network

KW - region-based convolutional neural network

UR - http://www.scopus.com/inward/record.url?scp=85061086645&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2018.2890573

DO - 10.1109/ACCESS.2018.2890573

M3 - Article

AN - SCOPUS:85061086645

SN - 2169-3536

VL - 7

SP - 10263

EP - 10277

JO - IEEE Access

JF - IEEE Access

M1 - 8598858

ER -

Facial component-landmark detection with weakly-supervised LR-CNN

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this