Abstract
Speech emotion recognition (SER) is of great importance in the human-robot interaction, and SER have been developed rapidly in recent years. How to identify high correlation features is a main question. In this paper, multiple random forest (MRF) is proposed, which can effectively predict up to several thousand explanatory variables, and improve the recognition result. At the mean time, personalized features and non-personalized features are fused for SER that can be divided into different subclasses by adopting the fuzzy C-means (FCM) clustering algorithm, then FCM-based MRF is proposed. Finally, our model is established, and the confusion matrix is output. Moreover, we conduct a separate classification of some certain emotion, which are difficult to recognition to some extent. And results show that the proposed method of FCM-based MRF achieves good performance for SER, which the recognition accuracy is 84.17%.
Original language | English |
---|---|
Publication status | Published - 2017 |
Externally published | Yes |
Event | 5th International Workshop on Advanced Computational Intelligence and Intelligent Informatics, IWACIII 2017 - Beijing, China Duration: 2 Nov 2017 → 5 Nov 2017 |
Conference
Conference | 5th International Workshop on Advanced Computational Intelligence and Intelligent Informatics, IWACIII 2017 |
---|---|
Country/Territory | China |
City | Beijing |
Period | 2/11/17 → 5/11/17 |
Keywords
- Fuzzy C-means
- Non-personalized feature
- Random forest
- Speech emotion recognition