Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression

Yuan Zong, Wenming Zheng*, Tong Zhang, Xiaohua Huang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

91 Citations (Scopus)

Abstract

In this letter, a novel cross-corpus speech emotion recognition (SER) method using domain-adaptive least-squares regression (DaLSR) model is proposed. In this method, an additional unlabeled data set from target speech corpus is used to serve as an auxiliary data set and combined with the labeled training data set from source speech corpus for jointly training the DaLSR model. In contrast to the traditional least-squares regression (LSR) method, the major novelty of DaLSR is that it is able to handle the mismatch problem between source and target speech corpora. Hence, the proposed DaLSR method is very suitable for coping with cross-corpus SER problem. For evaluating the performance of the proposed method in dealing with the cross-corpus SER problem, we conduct extensive experiments on three emotional speech corpora and compare the results with several state-of-the-art transfer learning methods that are widely used for cross-corpus SER problem. The experimental results show that the proposed method achieves better recognition accuracies than the state-of-the-art methods.

Original languageEnglish
Article number7425198
Pages (from-to)585-589
Number of pages5
JournalIEEE Signal Processing Letters
Volume23
Issue number5
DOIs
Publication statusPublished - May 2016
Externally publishedYes

Keywords

  • Cross-corpus speech emotion recognition
  • Domain adaptation
  • Transfer learning

Fingerprint

Dive into the research topics of 'Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression'. Together they form a unique fingerprint.

Cite this