Facial Expression Recognition Using Hybrid Features of Pixel and Geometry

Chang Liu; Kaoru Hirota; Junjie Ma; Zhiyang Jia; Yaping Dai

doi:10.1109/ACCESS.2021.3054332

Facial Expression Recognition Using Hybrid Features of Pixel and Geometry

Chang Liu, Kaoru Hirota, Junjie Ma, Zhiyang Jia, Yaping Dai^*

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

48 Citations (Scopus)

Abstract

Facial Expression Recognition (FER) has long been a challenging task in the field of computer vision. Most of the existing FER methods extract facial features on the basis of face pixels, ignoring the relative geometric position dependencies of facial landmark points. This article presents a hybrid feature extraction network to enhance the discriminative power of emotional features. The proposed network consists of a Spatial Attention Convolutional Neural Network (SACNN) and a series of Long Short-term Memory networks with Attention mechanism (ALSTMs). The SACNN is employed to extract the expressional features from static face images and the ALSTMs is designed to explore the potentials of facial landmarks for expression recognition. A deep geometric feature descriptor is proposed to characterize the relative geometric position correlation of facial landmarks. The landmarks are divided into seven groups to extract deep geometric features, and the attention module in ALSTMs can adaptively estimate the importance of different landmark regions. By jointly combining SACNN and ALSTMs, the hybrid features are obtained for expression recognition. Experiments conducted on three public databases, FER2013, CK+, and JAFFE, demonstrate that the proposed method outperforms the previous methods, with the accuracies of 74.31%, 95.15%, and 98.57%, respectively. The preliminary results of Emotion Understanding Robot System (EURS) indicate that the proposed method has the potential to improve the performance of human-robot interaction.

Original language	English
Article number	9335586
Pages (from-to)	18876-18889
Number of pages	14
Journal	IEEE Access
Volume	9
DOIs	https://doi.org/10.1109/ACCESS.2021.3054332
Publication status	Published - 2021

Keywords

Facial expression recognition
attention mechanism
hybrid feature
long short-term memory network
relative geometric position dependency

Access to Document

10.1109/ACCESS.2021.3054332

Cite this

@article{d6ad0bbcae394c0bb9c97dab7d9dd297,

title = "Facial Expression Recognition Using Hybrid Features of Pixel and Geometry",

abstract = "Facial Expression Recognition (FER) has long been a challenging task in the field of computer vision. Most of the existing FER methods extract facial features on the basis of face pixels, ignoring the relative geometric position dependencies of facial landmark points. This article presents a hybrid feature extraction network to enhance the discriminative power of emotional features. The proposed network consists of a Spatial Attention Convolutional Neural Network (SACNN) and a series of Long Short-term Memory networks with Attention mechanism (ALSTMs). The SACNN is employed to extract the expressional features from static face images and the ALSTMs is designed to explore the potentials of facial landmarks for expression recognition. A deep geometric feature descriptor is proposed to characterize the relative geometric position correlation of facial landmarks. The landmarks are divided into seven groups to extract deep geometric features, and the attention module in ALSTMs can adaptively estimate the importance of different landmark regions. By jointly combining SACNN and ALSTMs, the hybrid features are obtained for expression recognition. Experiments conducted on three public databases, FER2013, CK+, and JAFFE, demonstrate that the proposed method outperforms the previous methods, with the accuracies of 74.31%, 95.15%, and 98.57%, respectively. The preliminary results of Emotion Understanding Robot System (EURS) indicate that the proposed method has the potential to improve the performance of human-robot interaction.",

keywords = "Facial expression recognition, attention mechanism, hybrid feature, long short-term memory network, relative geometric position dependency",

author = "Chang Liu and Kaoru Hirota and Junjie Ma and Zhiyang Jia and Yaping Dai",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2021",

doi = "10.1109/ACCESS.2021.3054332",

language = "English",

volume = "9",

pages = "18876--18889",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Facial Expression Recognition Using Hybrid Features of Pixel and Geometry

AU - Liu, Chang

AU - Hirota, Kaoru

AU - Ma, Junjie

AU - Jia, Zhiyang

AU - Dai, Yaping

PY - 2021

Y1 - 2021

N2 - Facial Expression Recognition (FER) has long been a challenging task in the field of computer vision. Most of the existing FER methods extract facial features on the basis of face pixels, ignoring the relative geometric position dependencies of facial landmark points. This article presents a hybrid feature extraction network to enhance the discriminative power of emotional features. The proposed network consists of a Spatial Attention Convolutional Neural Network (SACNN) and a series of Long Short-term Memory networks with Attention mechanism (ALSTMs). The SACNN is employed to extract the expressional features from static face images and the ALSTMs is designed to explore the potentials of facial landmarks for expression recognition. A deep geometric feature descriptor is proposed to characterize the relative geometric position correlation of facial landmarks. The landmarks are divided into seven groups to extract deep geometric features, and the attention module in ALSTMs can adaptively estimate the importance of different landmark regions. By jointly combining SACNN and ALSTMs, the hybrid features are obtained for expression recognition. Experiments conducted on three public databases, FER2013, CK+, and JAFFE, demonstrate that the proposed method outperforms the previous methods, with the accuracies of 74.31%, 95.15%, and 98.57%, respectively. The preliminary results of Emotion Understanding Robot System (EURS) indicate that the proposed method has the potential to improve the performance of human-robot interaction.

AB - Facial Expression Recognition (FER) has long been a challenging task in the field of computer vision. Most of the existing FER methods extract facial features on the basis of face pixels, ignoring the relative geometric position dependencies of facial landmark points. This article presents a hybrid feature extraction network to enhance the discriminative power of emotional features. The proposed network consists of a Spatial Attention Convolutional Neural Network (SACNN) and a series of Long Short-term Memory networks with Attention mechanism (ALSTMs). The SACNN is employed to extract the expressional features from static face images and the ALSTMs is designed to explore the potentials of facial landmarks for expression recognition. A deep geometric feature descriptor is proposed to characterize the relative geometric position correlation of facial landmarks. The landmarks are divided into seven groups to extract deep geometric features, and the attention module in ALSTMs can adaptively estimate the importance of different landmark regions. By jointly combining SACNN and ALSTMs, the hybrid features are obtained for expression recognition. Experiments conducted on three public databases, FER2013, CK+, and JAFFE, demonstrate that the proposed method outperforms the previous methods, with the accuracies of 74.31%, 95.15%, and 98.57%, respectively. The preliminary results of Emotion Understanding Robot System (EURS) indicate that the proposed method has the potential to improve the performance of human-robot interaction.

KW - Facial expression recognition

KW - attention mechanism

KW - hybrid feature

KW - long short-term memory network

KW - relative geometric position dependency

UR - http://www.scopus.com/inward/record.url?scp=85100517904&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2021.3054332

DO - 10.1109/ACCESS.2021.3054332

M3 - Article

AN - SCOPUS:85100517904

SN - 2169-3536

VL - 9

SP - 18876

EP - 18889

JO - IEEE Access

JF - IEEE Access

M1 - 9335586

ER -

Facial Expression Recognition Using Hybrid Features of Pixel and Geometry

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this