跳到主要导航 跳到搜索 跳到主要内容

A Chinese Speech Recognition System Based on Binary Neural Network and Pre-processing

  • Lunyi Guo*
  • , Yijie Deng
  • , Liang Tang
  • , Ronggeng Fan
  • , Bo Yan
  • , Zhuoling Xiao
  • *此作品的通讯作者
  • University of Electronic Science and Technology of China

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Neural networks have made excellent progress in the field of speech recognition. However, more research needs to be done in some scenarios where computational resources are limited or real-Time, and low power consumption is required. In this paper, we propose a lightweight speech recognition model based on pre-processing + binary neural network, which can significantly reduce the number of weight parameters while ensuring an acceptable error rate. The speech pre-processing part converts the 1D speech signal to the 2D Mel spectrum and uses Voice Activate Detection (VAD) to make the speech Mel spectrum input variable. The speech data set is also expanded using data augmentation methods. For convolutional layers, the weights are binarized to reduce the number of model parameters and improve computational and storage efficiency. The number of model parameters after quantization is 6.94% of the number of full precision model parameters, and the error rate on the ST CMD speech dataset increases by only 2.07%. Finally, a circuit structure based on binary weights for convolutional computation is designed, and a single multiplication can be implemented using only the hardware resources of the 7 Look Up Table (LUT).

源语言英语
主期刊名2023 6th World Conference on Computing and Communication Technologies, WCCCT 2023
出版商Institute of Electrical and Electronics Engineers Inc.
129-134
页数6
ISBN(电子版)9781665461467
DOI
出版状态已出版 - 2023
已对外发布
活动6th World Conference on Computing and Communication Technologies, WCCCT 2023 - Virtual, Online, 中国
期限: 6 1月 20238 1月 2023

出版系列

姓名2023 6th World Conference on Computing and Communication Technologies, WCCCT 2023

会议

会议6th World Conference on Computing and Communication Technologies, WCCCT 2023
国家/地区中国
Virtual, Online
时期6/01/238/01/23

指纹

探究 'A Chinese Speech Recognition System Based on Binary Neural Network and Pre-processing' 的科研主题。它们共同构成独一无二的指纹。

引用此