Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression

Yuan Zong; Wenming Zheng; Tong Zhang; Xiaohua Huang

doi:10.1109/LSP.2016.2537926

Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression

Yuan Zong, Wenming Zheng^*, Tong Zhang, Xiaohua Huang

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

97 Citations (Scopus)

Abstract

In this letter, a novel cross-corpus speech emotion recognition (SER) method using domain-adaptive least-squares regression (DaLSR) model is proposed. In this method, an additional unlabeled data set from target speech corpus is used to serve as an auxiliary data set and combined with the labeled training data set from source speech corpus for jointly training the DaLSR model. In contrast to the traditional least-squares regression (LSR) method, the major novelty of DaLSR is that it is able to handle the mismatch problem between source and target speech corpora. Hence, the proposed DaLSR method is very suitable for coping with cross-corpus SER problem. For evaluating the performance of the proposed method in dealing with the cross-corpus SER problem, we conduct extensive experiments on three emotional speech corpora and compare the results with several state-of-the-art transfer learning methods that are widely used for cross-corpus SER problem. The experimental results show that the proposed method achieves better recognition accuracies than the state-of-the-art methods.

Original language	English
Article number	7425198
Pages (from-to)	585-589
Number of pages	5
Journal	IEEE Signal Processing Letters
Volume	23
Issue number	5
DOIs	https://doi.org/10.1109/LSP.2016.2537926
Publication status	Published - May 2016
Externally published	Yes

Keywords

Cross-corpus speech emotion recognition
Domain adaptation
Transfer learning

Access to Document

10.1109/LSP.2016.2537926

Cite this

Zong, Y., Zheng, W., Zhang, T., & Huang, X. (2016). Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression. IEEE Signal Processing Letters, 23(5), 585-589. Article 7425198. https://doi.org/10.1109/LSP.2016.2537926

@article{e06d28c3a6a64ec083e68c9bd64c9025,

title = "Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression",

abstract = "In this letter, a novel cross-corpus speech emotion recognition (SER) method using domain-adaptive least-squares regression (DaLSR) model is proposed. In this method, an additional unlabeled data set from target speech corpus is used to serve as an auxiliary data set and combined with the labeled training data set from source speech corpus for jointly training the DaLSR model. In contrast to the traditional least-squares regression (LSR) method, the major novelty of DaLSR is that it is able to handle the mismatch problem between source and target speech corpora. Hence, the proposed DaLSR method is very suitable for coping with cross-corpus SER problem. For evaluating the performance of the proposed method in dealing with the cross-corpus SER problem, we conduct extensive experiments on three emotional speech corpora and compare the results with several state-of-the-art transfer learning methods that are widely used for cross-corpus SER problem. The experimental results show that the proposed method achieves better recognition accuracies than the state-of-the-art methods.",

keywords = "Cross-corpus speech emotion recognition, Domain adaptation, Transfer learning",

author = "Yuan Zong and Wenming Zheng and Tong Zhang and Xiaohua Huang",

note = "Publisher Copyright: {\textcopyright} 1994-2012 IEEE.",

year = "2016",

month = may,

doi = "10.1109/LSP.2016.2537926",

language = "English",

volume = "23",

pages = "585--589",

journal = "IEEE Signal Processing Letters",

issn = "1070-9908",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "5",

}

TY - JOUR

T1 - Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression

AU - Zong, Yuan

AU - Zheng, Wenming

AU - Zhang, Tong

AU - Huang, Xiaohua

PY - 2016/5

Y1 - 2016/5

N2 - In this letter, a novel cross-corpus speech emotion recognition (SER) method using domain-adaptive least-squares regression (DaLSR) model is proposed. In this method, an additional unlabeled data set from target speech corpus is used to serve as an auxiliary data set and combined with the labeled training data set from source speech corpus for jointly training the DaLSR model. In contrast to the traditional least-squares regression (LSR) method, the major novelty of DaLSR is that it is able to handle the mismatch problem between source and target speech corpora. Hence, the proposed DaLSR method is very suitable for coping with cross-corpus SER problem. For evaluating the performance of the proposed method in dealing with the cross-corpus SER problem, we conduct extensive experiments on three emotional speech corpora and compare the results with several state-of-the-art transfer learning methods that are widely used for cross-corpus SER problem. The experimental results show that the proposed method achieves better recognition accuracies than the state-of-the-art methods.

AB - In this letter, a novel cross-corpus speech emotion recognition (SER) method using domain-adaptive least-squares regression (DaLSR) model is proposed. In this method, an additional unlabeled data set from target speech corpus is used to serve as an auxiliary data set and combined with the labeled training data set from source speech corpus for jointly training the DaLSR model. In contrast to the traditional least-squares regression (LSR) method, the major novelty of DaLSR is that it is able to handle the mismatch problem between source and target speech corpora. Hence, the proposed DaLSR method is very suitable for coping with cross-corpus SER problem. For evaluating the performance of the proposed method in dealing with the cross-corpus SER problem, we conduct extensive experiments on three emotional speech corpora and compare the results with several state-of-the-art transfer learning methods that are widely used for cross-corpus SER problem. The experimental results show that the proposed method achieves better recognition accuracies than the state-of-the-art methods.

KW - Cross-corpus speech emotion recognition

KW - Domain adaptation

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=84963795121&partnerID=8YFLogxK

U2 - 10.1109/LSP.2016.2537926

DO - 10.1109/LSP.2016.2537926

M3 - Article

AN - SCOPUS:84963795121

SN - 1070-9908

VL - 23

SP - 585

EP - 589

JO - IEEE Signal Processing Letters

JF - IEEE Signal Processing Letters

IS - 5

M1 - 7425198

ER -

Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this