Non-intrusive speech quality assessment using deep belief network and backpropagation neural network

Yahui Shan, Jing Wang*, Xiang Xie, Liuchen Meng, Jingming Kuang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

10 引用 (Scopus)

摘要

In this paper, we present a new speech quality assessment method to estimate the quality of degraded speech without the reference speech. The traditional non-intrusive assessment methods cannot meet the requirement of high consistency with subjective results owing to the lack of original reference signals. To solve these issues, deep belief network is trained to produce pseudo-reference speech signal of degraded speech. Then mel-frequency cepstrum coefficients of pseudo-reference speech and degraded speech are extracted to calculate feature differences. The feature differences are mapped to speech quality score using backpropagation neural network. Experiments are conducted in a dataset containing various degraded speech signals and subjective listening scores. When compared with the standardization ITU-T P.563, Gaussian Mixture Model method and the autoencoder-based method, the proposed method brings about a higher correlation coefficient between predicted scores and subjective scores.

源语言英语
主期刊名2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
71-75
页数5
ISBN(电子版)9781538656273
DOI
出版状态已出版 - 2 7月 2018
活动11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Taipei, 中国台湾
期限: 26 11月 201829 11月 2018

出版系列

姓名2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

会议

会议11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018
国家/地区中国台湾
Taipei
时期26/11/1829/11/18

指纹

探究 'Non-intrusive speech quality assessment using deep belief network and backpropagation neural network' 的科研主题。它们共同构成独一无二的指纹。

引用此