残差网络在婴幼儿哭声识别中的应用

Xiang Xie*, Liqiang Zhang, Jing Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

7 引用 (Scopus)

摘要

The deep learning model based on the residual network and the spectrogram is used to recognize infant crying. The corpus has balanced proportion of infant crying and non-crying samples. Finally, through the 5-fold cross validation, compared with three models of Support Vector Machine (SVM), Convolutional Neural Network (CNN) and the cochleagram residual network based on Gammatone filters (GT-Resnet), the spectrogram based residual network gets the best F1-score of 0.9965 and satisfies requirements of real time. It is proved that the spectrogram can react acoustics features intuitively and comprehensively in the recognition of infant crying. The residual network based on spectrogram is a good solution to infant crying recognition problem.

投稿的翻译标题Application of Residual Network to Infant Crying Recognition
源语言繁体中文
页(从-至)233-239
页数7
期刊Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology
41
1
DOI
出版状态已出版 - 1 1月 2019

关键词

  • Deep learning
  • Infant crying recognition
  • Residual network
  • Spectrogram

指纹

探究 '残差网络在婴幼儿哭声识别中的应用' 的科研主题。它们共同构成独一无二的指纹。

引用此