Multi-layer hybrid fuzzy classification based on svm and improved pso for speech emotion recognition

Shihan Huang, Hua Dang, Rongkun Jiang, Yue Hao, Chengbo Xue, Wei Gu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

15 Citations (Scopus)

Abstract

Speech Emotion Recognition (SER) plays a significant role in the field of Human–Computer Interaction (HCI) with a wide range of applications. However, there are still some issues in practical application. One of the issues is the difference between emotional expression amongst various individuals, and another is that some indistinguishable emotions may reduce the stability of the SER system. In this paper, we propose a multi-layer hybrid fuzzy support vector machine (MLHF-SVM) model, which includes three layers: feature extraction layer, pre-classification layer, and classification layer. The MLHF-SVM model solves the above-mentioned issues by fuzzy c-means (FCM) based on identification information of human and multi-layer SVM classifiers, respectively. In addition, to overcome the weakness that FCM tends to fall into local minima, an improved natural exponential inertia weight particle swarm optimization (IEPSO) algorithm is proposed and integrated with fuzzy c-means for optimization. Moreover, in the feature extraction layer, non-personalized features and personalized features are combined to improve accuracy. In order to verify the effectiveness of the proposed model, all emotions in three popular datasets are used for simulation. The results show that this model can effectively improve the success rate of classification and the maximum value of a single emotion recognition rate is 97.67% on the EmoDB dataset.

Original languageEnglish
Article number2891
JournalElectronics (Switzerland)
Volume10
Issue number23
DOIs
Publication statusPublished - Dec 2021

Keywords

  • Fuzzy c-means
  • Particle swarm optimization
  • Speech emotion recognition
  • Support vector machines

Fingerprint

Dive into the research topics of 'Multi-layer hybrid fuzzy classification based on svm and improved pso for speech emotion recognition'. Together they form a unique fingerprint.

Cite this