Abstract
A Bidirectional Long Short-Term Memory(BLSTM) network is applied to improve the accuracy of Chinese speech emotion recognition of six basic human emotions (angry, fear, happy, neutral, sad, and surprise). The features of emotions can be learned and saved by BLSTM network whose special architecture called memory blocks is used to remember information from a long sentence, and BLSTM network provides information both from history and future of the current frame for the importance of the context of sentences. Results of experiments on the CASIA Chinese emotion corpus show that the average recognition accuracy reaches 73.83%, and has a 9.83% increase compared with the method based on information cell, 7.83% increase compared with Mel Frequency Cepstrum Coefficient and Principal Component Analysis, and 24.83% increase compared with Random Deep Belief Networks.
Original language | English |
---|---|
Publication status | Published - 2017 |
Event | 5th International Workshop on Advanced Computational Intelligence and Intelligent Informatics, IWACIII 2017 - Beijing, China Duration: 2 Nov 2017 → 5 Nov 2017 |
Conference
Conference | 5th International Workshop on Advanced Computational Intelligence and Intelligent Informatics, IWACIII 2017 |
---|---|
Country/Territory | China |
City | Beijing |
Period | 2/11/17 → 5/11/17 |
Keywords
- Bidirectional long short-term memory
- CASIA Chinese corpus
- Chinese speech emotion recognition