残差网络在婴幼儿哭声识别中的应用

Translated title of the contribution: Application of Residual Network to Infant Crying Recognition

Xiang Xie*, Liqiang Zhang, Jing Wang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

The deep learning model based on the residual network and the spectrogram is used to recognize infant crying. The corpus has balanced proportion of infant crying and non-crying samples. Finally, through the 5-fold cross validation, compared with three models of Support Vector Machine (SVM), Convolutional Neural Network (CNN) and the cochleagram residual network based on Gammatone filters (GT-Resnet), the spectrogram based residual network gets the best F1-score of 0.9965 and satisfies requirements of real time. It is proved that the spectrogram can react acoustics features intuitively and comprehensively in the recognition of infant crying. The residual network based on spectrogram is a good solution to infant crying recognition problem.

Translated title of the contributionApplication of Residual Network to Infant Crying Recognition
Original languageChinese (Traditional)
Pages (from-to)233-239
Number of pages7
JournalDianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology
Volume41
Issue number1
DOIs
Publication statusPublished - 1 Jan 2019

Fingerprint

Dive into the research topics of 'Application of Residual Network to Infant Crying Recognition'. Together they form a unique fingerprint.

Cite this