Two-Stage Fuzzy Fusion Based-Convolution Neural Network for Dynamic Emotion Recognition

Luefeng Chen; Min Wu; Witold Pedrycz; Kaoru Hirota

doi:10.1007/978-3-030-61577-2_7

Two-Stage Fuzzy Fusion Based-Convolution Neural Network for Dynamic Emotion Recognition

Luefeng Chen^*, Min Wu, Witold Pedrycz, Kaoru Hirota

^*此作品的通讯作者

科研成果: 书/报告/会议事项章节 › 章节 › 同行评审

1 引用（Scopus）

摘要

The two-stage fuzzy fusion based-convolution neural network is proposed for dynamic emotion recognition by using both facial expression and speech modalities, which not only can extract discriminative emotion features which contain spatio-temporal information, but can also effectively fuse facial expression and speech modalities. Moreover, the proposal is able to handle situations where the contributions of each modality data to emotion recognition are very imbalanced. The local binary patterns coming from three orthogonal planes and spectrogram are considered first to extract low-level dynamic emotion, so that the spatio-temporal information of these modalities can be obtained. To reveal more discriminative features, two deep convolution neural networks are constructed to extract high-level emotion semantic features. Moreover, the two stage fuzzy fusion strategy is developed by integrating canonical correlation analysis and fuzzy broad learning system, so as to take into account the correlation and difference between different modal features, as well as handle the ambiguity of emotional state information.

源语言	英语
主期刊名	Studies in Computational Intelligence
出版商	Springer Science and Business Media Deutschland GmbH
页	91-114
页数	24
DOI	https://doi.org/10.1007/978-3-030-61577-2_7
出版状态	已出版 - 2021
已对外发布	是

出版系列

姓名	Studies in Computational Intelligence
卷	926
ISSN（印刷版）	1860-949X
ISSN（电子版）	1860-9503

访问文件

10.1007/978-3-030-61577-2_7

其它文件与链接

链接到 Scopus 的出版物

引用此

Chen, L., Wu, M., Pedrycz, W., & Hirota, K. (2021). Two-Stage Fuzzy Fusion Based-Convolution Neural Network for Dynamic Emotion Recognition. 在 Studies in Computational Intelligence (页码 91-114). (Studies in Computational Intelligence; 卷 926). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-61577-2_7

@inbook{5d86e5ceaf534ab1bba070c691915b82,

title = "Two-Stage Fuzzy Fusion Based-Convolution Neural Network for Dynamic Emotion Recognition",

abstract = "The two-stage fuzzy fusion based-convolution neural network is proposed for dynamic emotion recognition by using both facial expression and speech modalities, which not only can extract discriminative emotion features which contain spatio-temporal information, but can also effectively fuse facial expression and speech modalities. Moreover, the proposal is able to handle situations where the contributions of each modality data to emotion recognition are very imbalanced. The local binary patterns coming from three orthogonal planes and spectrogram are considered first to extract low-level dynamic emotion, so that the spatio-temporal information of these modalities can be obtained. To reveal more discriminative features, two deep convolution neural networks are constructed to extract high-level emotion semantic features. Moreover, the two stage fuzzy fusion strategy is developed by integrating canonical correlation analysis and fuzzy broad learning system, so as to take into account the correlation and difference between different modal features, as well as handle the ambiguity of emotional state information.",

author = "Luefeng Chen and Min Wu and Witold Pedrycz and Kaoru Hirota",

note = "Publisher Copyright: {\textcopyright} 2020, The Author(s), under exclusive license to Springer Nature Switzerland AG.",

year = "2021",

doi = "10.1007/978-3-030-61577-2_7",

language = "English",

series = "Studies in Computational Intelligence",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "91--114",

booktitle = "Studies in Computational Intelligence",

address = "Germany",

}

TY - CHAP

T1 - Two-Stage Fuzzy Fusion Based-Convolution Neural Network for Dynamic Emotion Recognition

AU - Chen, Luefeng

AU - Wu, Min

AU - Pedrycz, Witold

AU - Hirota, Kaoru

PY - 2021

Y1 - 2021

N2 - The two-stage fuzzy fusion based-convolution neural network is proposed for dynamic emotion recognition by using both facial expression and speech modalities, which not only can extract discriminative emotion features which contain spatio-temporal information, but can also effectively fuse facial expression and speech modalities. Moreover, the proposal is able to handle situations where the contributions of each modality data to emotion recognition are very imbalanced. The local binary patterns coming from three orthogonal planes and spectrogram are considered first to extract low-level dynamic emotion, so that the spatio-temporal information of these modalities can be obtained. To reveal more discriminative features, two deep convolution neural networks are constructed to extract high-level emotion semantic features. Moreover, the two stage fuzzy fusion strategy is developed by integrating canonical correlation analysis and fuzzy broad learning system, so as to take into account the correlation and difference between different modal features, as well as handle the ambiguity of emotional state information.

AB - The two-stage fuzzy fusion based-convolution neural network is proposed for dynamic emotion recognition by using both facial expression and speech modalities, which not only can extract discriminative emotion features which contain spatio-temporal information, but can also effectively fuse facial expression and speech modalities. Moreover, the proposal is able to handle situations where the contributions of each modality data to emotion recognition are very imbalanced. The local binary patterns coming from three orthogonal planes and spectrogram are considered first to extract low-level dynamic emotion, so that the spatio-temporal information of these modalities can be obtained. To reveal more discriminative features, two deep convolution neural networks are constructed to extract high-level emotion semantic features. Moreover, the two stage fuzzy fusion strategy is developed by integrating canonical correlation analysis and fuzzy broad learning system, so as to take into account the correlation and difference between different modal features, as well as handle the ambiguity of emotional state information.

UR - http://www.scopus.com/inward/record.url?scp=85096205574&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-61577-2_7

DO - 10.1007/978-3-030-61577-2_7

M3 - Chapter

AN - SCOPUS:85096205574

T3 - Studies in Computational Intelligence

SP - 91

EP - 114

BT - Studies in Computational Intelligence

PB - Springer Science and Business Media Deutschland GmbH

ER -

Two-Stage Fuzzy Fusion Based-Convolution Neural Network for Dynamic Emotion Recognition

摘要

出版系列

访问文件

其它文件与链接

指纹

引用此