Multi-Convolution Neural Networks-Based Deep Learning Model for Emotion Understanding

Luefeng Chen; Min Wu; Wanjuan Su; Kaoru Hirota

doi:10.23919/ChiCC.2018.8483607

Multi-Convolution Neural Networks-Based Deep Learning Model for Emotion Understanding

Luefeng Chen, Min Wu^*, Wanjuan Su, Kaoru Hirota

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

3 Citations (Scopus)

Abstract

Multi-convolution neural networks-based deep learning model in combination with multimodal data for emotion understanding is proposed, in which facial expression and body gesture are used to achieve emotional states recognition for emotion understanding. It aims to understand coexistence multimodal information in human-robot interaction by using multi-convolution neural networks, where multilayer convolutions are connected in series and multiple networks are executed in parallel. Moreover, when optimizing the weights of deep neural network by traditional method, it is easy to fall into poor local optimal. To address this problem, a hybrid genetic algorithm with stochastic gradient descent is developed, which has the capacity of inherent implicit parallelism and better global optimization of genetic algorithm so that it can adaptively find the better weights of the network. And in order to speed up the convergence of the proposal, the weights optimized by stochastic gradient descent will be taken as a chromosome of genetic algorithms initial population, and it also can be used as a priori knowledge. To verify the effectiveness of the proposal, experiments on benchmark database of spontaneous emotion expressions are developed, and experimental results show that the proposal outperforms the state-of-the-art methods. Meanwhile, the preliminary application experiments are also carried out and the results indicate that the proposal can be extended to human-robot interaction.

Original language	English
Title of host publication	Proceedings of the 37th Chinese Control Conference, CCC 2018
Editors	Xin Chen, Qianchuan Zhao
Publisher	IEEE Computer Society
Pages	9545-9549
Number of pages	5
ISBN (Electronic)	9789881563941
DOIs	https://doi.org/10.23919/ChiCC.2018.8483607
Publication status	Published - 5 Oct 2018
Externally published	Yes
Event	37th Chinese Control Conference, CCC 2018 - Wuhan, China Duration: 25 Jul 2018 → 27 Jul 2018

Publication series

Name	Chinese Control Conference, CCC
Volume	2018-July
ISSN (Print)	1934-1768
ISSN (Electronic)	2161-2927

Conference

Conference	37th Chinese Control Conference, CCC 2018
Country/Territory	China
City	Wuhan
Period	25/07/18 → 27/07/18

Keywords

Body gesture
Convolution neural network
Deep learning
Emotion understanding
Facial expression

Access to Document

10.23919/ChiCC.2018.8483607

Cite this

Chen, L., Wu, M., Su, W., & Hirota, K. (2018). Multi-Convolution Neural Networks-Based Deep Learning Model for Emotion Understanding. In X. Chen, & Q. Zhao (Eds.), Proceedings of the 37th Chinese Control Conference, CCC 2018 (pp. 9545-9549). Article 8483607 (Chinese Control Conference, CCC; Vol. 2018-July). IEEE Computer Society. https://doi.org/10.23919/ChiCC.2018.8483607

@inproceedings{6e9d8d2bd3734ea5b2b3ab6485123ebd,

title = "Multi-Convolution Neural Networks-Based Deep Learning Model for Emotion Understanding",

abstract = "Multi-convolution neural networks-based deep learning model in combination with multimodal data for emotion understanding is proposed, in which facial expression and body gesture are used to achieve emotional states recognition for emotion understanding. It aims to understand coexistence multimodal information in human-robot interaction by using multi-convolution neural networks, where multilayer convolutions are connected in series and multiple networks are executed in parallel. Moreover, when optimizing the weights of deep neural network by traditional method, it is easy to fall into poor local optimal. To address this problem, a hybrid genetic algorithm with stochastic gradient descent is developed, which has the capacity of inherent implicit parallelism and better global optimization of genetic algorithm so that it can adaptively find the better weights of the network. And in order to speed up the convergence of the proposal, the weights optimized by stochastic gradient descent will be taken as a chromosome of genetic algorithms initial population, and it also can be used as a priori knowledge. To verify the effectiveness of the proposal, experiments on benchmark database of spontaneous emotion expressions are developed, and experimental results show that the proposal outperforms the state-of-the-art methods. Meanwhile, the preliminary application experiments are also carried out and the results indicate that the proposal can be extended to human-robot interaction.",

keywords = "Body gesture, Convolution neural network, Deep learning, Emotion understanding, Facial expression",

author = "Luefeng Chen and Min Wu and Wanjuan Su and Kaoru Hirota",

note = "Publisher Copyright: {\textcopyright} 2018 Technical Committee on Control Theory, Chinese Association of Automation.; 37th Chinese Control Conference, CCC 2018 ; Conference date: 25-07-2018 Through 27-07-2018",

year = "2018",

month = oct,

day = "5",

doi = "10.23919/ChiCC.2018.8483607",

language = "English",

series = "Chinese Control Conference, CCC",

publisher = "IEEE Computer Society",

pages = "9545--9549",

editor = "Xin Chen and Qianchuan Zhao",

booktitle = "Proceedings of the 37th Chinese Control Conference, CCC 2018",

address = "United States",

}

Chen, L, Wu, M, Su, W & Hirota, K 2018, Multi-Convolution Neural Networks-Based Deep Learning Model for Emotion Understanding. in X Chen & Q Zhao (eds), Proceedings of the 37th Chinese Control Conference, CCC 2018., 8483607, Chinese Control Conference, CCC, vol. 2018-July, IEEE Computer Society, pp. 9545-9549, 37th Chinese Control Conference, CCC 2018, Wuhan, China, 25/07/18. https://doi.org/10.23919/ChiCC.2018.8483607

Multi-Convolution Neural Networks-Based Deep Learning Model for Emotion Understanding. / Chen, Luefeng; Wu, Min; Su, Wanjuan et al.
Proceedings of the 37th Chinese Control Conference, CCC 2018. ed. / Xin Chen; Qianchuan Zhao. IEEE Computer Society, 2018. p. 9545-9549 8483607 (Chinese Control Conference, CCC; Vol. 2018-July).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-Convolution Neural Networks-Based Deep Learning Model for Emotion Understanding

AU - Chen, Luefeng

AU - Wu, Min

AU - Su, Wanjuan

AU - Hirota, Kaoru

PY - 2018/10/5

Y1 - 2018/10/5

N2 - Multi-convolution neural networks-based deep learning model in combination with multimodal data for emotion understanding is proposed, in which facial expression and body gesture are used to achieve emotional states recognition for emotion understanding. It aims to understand coexistence multimodal information in human-robot interaction by using multi-convolution neural networks, where multilayer convolutions are connected in series and multiple networks are executed in parallel. Moreover, when optimizing the weights of deep neural network by traditional method, it is easy to fall into poor local optimal. To address this problem, a hybrid genetic algorithm with stochastic gradient descent is developed, which has the capacity of inherent implicit parallelism and better global optimization of genetic algorithm so that it can adaptively find the better weights of the network. And in order to speed up the convergence of the proposal, the weights optimized by stochastic gradient descent will be taken as a chromosome of genetic algorithms initial population, and it also can be used as a priori knowledge. To verify the effectiveness of the proposal, experiments on benchmark database of spontaneous emotion expressions are developed, and experimental results show that the proposal outperforms the state-of-the-art methods. Meanwhile, the preliminary application experiments are also carried out and the results indicate that the proposal can be extended to human-robot interaction.

AB - Multi-convolution neural networks-based deep learning model in combination with multimodal data for emotion understanding is proposed, in which facial expression and body gesture are used to achieve emotional states recognition for emotion understanding. It aims to understand coexistence multimodal information in human-robot interaction by using multi-convolution neural networks, where multilayer convolutions are connected in series and multiple networks are executed in parallel. Moreover, when optimizing the weights of deep neural network by traditional method, it is easy to fall into poor local optimal. To address this problem, a hybrid genetic algorithm with stochastic gradient descent is developed, which has the capacity of inherent implicit parallelism and better global optimization of genetic algorithm so that it can adaptively find the better weights of the network. And in order to speed up the convergence of the proposal, the weights optimized by stochastic gradient descent will be taken as a chromosome of genetic algorithms initial population, and it also can be used as a priori knowledge. To verify the effectiveness of the proposal, experiments on benchmark database of spontaneous emotion expressions are developed, and experimental results show that the proposal outperforms the state-of-the-art methods. Meanwhile, the preliminary application experiments are also carried out and the results indicate that the proposal can be extended to human-robot interaction.

KW - Body gesture

KW - Convolution neural network

KW - Deep learning

KW - Emotion understanding

KW - Facial expression

UR - http://www.scopus.com/inward/record.url?scp=85056125815&partnerID=8YFLogxK

U2 - 10.23919/ChiCC.2018.8483607

DO - 10.23919/ChiCC.2018.8483607

M3 - Conference contribution

AN - SCOPUS:85056125815

T3 - Chinese Control Conference, CCC

SP - 9545

EP - 9549

BT - Proceedings of the 37th Chinese Control Conference, CCC 2018

A2 - Chen, Xin

A2 - Zhao, Qianchuan

PB - IEEE Computer Society

T2 - 37th Chinese Control Conference, CCC 2018

Y2 - 25 July 2018 through 27 July 2018

ER -

Multi-Convolution Neural Networks-Based Deep Learning Model for Emotion Understanding

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this