Softmax regression based deep sparse autoencoder network for facial emotion recognition in human-robot interaction

Luefeng Chen; Mengtian Zhou; Wanjuan Su; Min Wu; Jinhua She; Kaoru Hirota

doi:10.1016/j.ins.2017.10.044

Softmax regression based deep sparse autoencoder network for facial emotion recognition in human-robot interaction

Luefeng Chen, Mengtian Zhou, Wanjuan Su, Min Wu^*, Jinhua She, Kaoru Hirota

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

178 引用（Scopus）

摘要

Deep neural network (DNN) has been used as a learning model for modeling the hierarchical architecture of human brain. However, DNN suffers from problems of learning efficiency and computational complexity. To address these problems, deep sparse autoencoder network (DSAN) is used for learning facial features, which considers the sparsity of hidden units for learning high-level structures. Meanwhile, Softmax regression (SR) is used to classify expression feature. In this paper, Softmax regression-based deep sparse autoencoder network (SRDSAN) is proposed to recognize facial emotion in human-robot interaction. It aims to handle large data in the output of deep learning by using SR, moreover, to overcome local extrema and gradient diffusion problems in the training process, the overall network weights are fine-tuned to reach the global optimum, which makes the entire depth of the neural network more robust, thereby enhancing the performance of facial emotion recognition. Results show that the average recognition accuracy of SRDSAN is higher than that of the SR and the convolutional neural network. The preliminarily application experiments are performed in the developing emotional social robot system (ESRS) with two mobile robots, where emotional social robot is able to recognize emotions such as happiness and angry.

源语言	英语
页（从-至）	49-61
页数	13
期刊	Information Sciences
卷	428
DOI	https://doi.org/10.1016/j.ins.2017.10.044
出版状态	已出版 - 2月 2018
已对外发布	是

访问文件

10.1016/j.ins.2017.10.044

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{213c13176af34d4ea34f3468f5f4c472,

title = "Softmax regression based deep sparse autoencoder network for facial emotion recognition in human-robot interaction",

abstract = "Deep neural network (DNN) has been used as a learning model for modeling the hierarchical architecture of human brain. However, DNN suffers from problems of learning efficiency and computational complexity. To address these problems, deep sparse autoencoder network (DSAN) is used for learning facial features, which considers the sparsity of hidden units for learning high-level structures. Meanwhile, Softmax regression (SR) is used to classify expression feature. In this paper, Softmax regression-based deep sparse autoencoder network (SRDSAN) is proposed to recognize facial emotion in human-robot interaction. It aims to handle large data in the output of deep learning by using SR, moreover, to overcome local extrema and gradient diffusion problems in the training process, the overall network weights are fine-tuned to reach the global optimum, which makes the entire depth of the neural network more robust, thereby enhancing the performance of facial emotion recognition. Results show that the average recognition accuracy of SRDSAN is higher than that of the SR and the convolutional neural network. The preliminarily application experiments are performed in the developing emotional social robot system (ESRS) with two mobile robots, where emotional social robot is able to recognize emotions such as happiness and angry.",

keywords = "Deep sparse autoencoder network, Facial emotion recognition, Human-robot interaction, Softmax regression",

author = "Luefeng Chen and Mengtian Zhou and Wanjuan Su and Min Wu and Jinhua She and Kaoru Hirota",

note = "Publisher Copyright: {\textcopyright} 2017",

year = "2018",

month = feb,

doi = "10.1016/j.ins.2017.10.044",

language = "English",

volume = "428",

pages = "49--61",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Softmax regression based deep sparse autoencoder network for facial emotion recognition in human-robot interaction

AU - Chen, Luefeng

AU - Zhou, Mengtian

AU - Su, Wanjuan

AU - Wu, Min

AU - She, Jinhua

AU - Hirota, Kaoru

PY - 2018/2

Y1 - 2018/2

N2 - Deep neural network (DNN) has been used as a learning model for modeling the hierarchical architecture of human brain. However, DNN suffers from problems of learning efficiency and computational complexity. To address these problems, deep sparse autoencoder network (DSAN) is used for learning facial features, which considers the sparsity of hidden units for learning high-level structures. Meanwhile, Softmax regression (SR) is used to classify expression feature. In this paper, Softmax regression-based deep sparse autoencoder network (SRDSAN) is proposed to recognize facial emotion in human-robot interaction. It aims to handle large data in the output of deep learning by using SR, moreover, to overcome local extrema and gradient diffusion problems in the training process, the overall network weights are fine-tuned to reach the global optimum, which makes the entire depth of the neural network more robust, thereby enhancing the performance of facial emotion recognition. Results show that the average recognition accuracy of SRDSAN is higher than that of the SR and the convolutional neural network. The preliminarily application experiments are performed in the developing emotional social robot system (ESRS) with two mobile robots, where emotional social robot is able to recognize emotions such as happiness and angry.

AB - Deep neural network (DNN) has been used as a learning model for modeling the hierarchical architecture of human brain. However, DNN suffers from problems of learning efficiency and computational complexity. To address these problems, deep sparse autoencoder network (DSAN) is used for learning facial features, which considers the sparsity of hidden units for learning high-level structures. Meanwhile, Softmax regression (SR) is used to classify expression feature. In this paper, Softmax regression-based deep sparse autoencoder network (SRDSAN) is proposed to recognize facial emotion in human-robot interaction. It aims to handle large data in the output of deep learning by using SR, moreover, to overcome local extrema and gradient diffusion problems in the training process, the overall network weights are fine-tuned to reach the global optimum, which makes the entire depth of the neural network more robust, thereby enhancing the performance of facial emotion recognition. Results show that the average recognition accuracy of SRDSAN is higher than that of the SR and the convolutional neural network. The preliminarily application experiments are performed in the developing emotional social robot system (ESRS) with two mobile robots, where emotional social robot is able to recognize emotions such as happiness and angry.

KW - Deep sparse autoencoder network

KW - Facial emotion recognition

KW - Human-robot interaction

KW - Softmax regression

UR - http://www.scopus.com/inward/record.url?scp=85032584576&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2017.10.044

DO - 10.1016/j.ins.2017.10.044

M3 - Article

AN - SCOPUS:85032584576

SN - 0020-0255

VL - 428

SP - 49

EP - 61

JO - Information Sciences

JF - Information Sciences

ER -

Softmax regression based deep sparse autoencoder network for facial emotion recognition in human-robot interaction

摘要

访问文件

其它文件与链接

指纹

引用此