Chinese Dialect Speech Recognition Based on End-to-end Machine Learning

Fengrun Zhang, Xiang Xie, Xinyue Quan

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

With the development of End-to-end neural network, End-to-end speech recognition has achieved comparable performance with traditional speech recognition methods. The End-to-end speech recognition model only needs the speech features of the input and the text information of the output. This paper takes advantage of the End-to-end method and uses the dataset provided by the Oriental Language Recognition Challenge to build a Chinese dialect recognition system for Sichuanese, Hokkien, Shanghainese and Cantonese. Dialect data belongs to low-resource languages. In this paper, in view of the lack of dialect data resources, a method of adding unrelated languages for joint training and adding Chinese language model for joint decoding is proposed for dialect speech recognition. The model has a relative improvement of 12% in Character Error Rate compared with the Baseline systerm.

源语言英语
主期刊名Proceedings - 2022 International Conference on Machine Learning, Control, and Robotics, MLCR 2022
出版商Institute of Electrical and Electronics Engineers Inc.
14-18
页数5
ISBN(电子版)9781665454599
DOI
出版状态已出版 - 2022
活动2022 International Conference on Machine Learning, Control, and Robotics, MLCR 2022 - Suzhou, 中国
期限: 29 10月 202231 10月 2022

出版系列

姓名Proceedings - 2022 International Conference on Machine Learning, Control, and Robotics, MLCR 2022

会议

会议2022 International Conference on Machine Learning, Control, and Robotics, MLCR 2022
国家/地区中国
Suzhou
时期29/10/2231/10/22

指纹

探究 'Chinese Dialect Speech Recognition Based on End-to-end Machine Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此