A Chinese Speech Recognition System Based on Binary Neural Network and Pre-processing

Lunyi Guo*, Yijie Deng, Liang Tang, Ronggeng Fan, Bo Yan, Zhuoling Xiao

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Neural networks have made excellent progress in the field of speech recognition. However, more research needs to be done in some scenarios where computational resources are limited or real-Time, and low power consumption is required. In this paper, we propose a lightweight speech recognition model based on pre-processing + binary neural network, which can significantly reduce the number of weight parameters while ensuring an acceptable error rate. The speech pre-processing part converts the 1D speech signal to the 2D Mel spectrum and uses Voice Activate Detection (VAD) to make the speech Mel spectrum input variable. The speech data set is also expanded using data augmentation methods. For convolutional layers, the weights are binarized to reduce the number of model parameters and improve computational and storage efficiency. The number of model parameters after quantization is 6.94% of the number of full precision model parameters, and the error rate on the ST CMD speech dataset increases by only 2.07%. Finally, a circuit structure based on binary weights for convolutional computation is designed, and a single multiplication can be implemented using only the hardware resources of the 7 Look Up Table (LUT).

Original languageEnglish
Title of host publication2023 6th World Conference on Computing and Communication Technologies, WCCCT 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages129-134
Number of pages6
ISBN (Electronic)9781665461467
DOIs
Publication statusPublished - 2023
Externally publishedYes
Event6th World Conference on Computing and Communication Technologies, WCCCT 2023 - Virtual, Online, China
Duration: 6 Jan 20238 Jan 2023

Publication series

Name2023 6th World Conference on Computing and Communication Technologies, WCCCT 2023

Conference

Conference6th World Conference on Computing and Communication Technologies, WCCCT 2023
Country/TerritoryChina
CityVirtual, Online
Period6/01/238/01/23

Keywords

  • and edge compute
  • binary weights neural network
  • speech recognize
  • voice activate detection

Fingerprint

Dive into the research topics of 'A Chinese Speech Recognition System Based on Binary Neural Network and Pre-processing'. Together they form a unique fingerprint.

Cite this