Non-intrusive speech quality assessment using deep belief network and backpropagation neural network

Yahui Shan, Jing Wang*, Xiang Xie, Liuchen Meng, Jingming Kuang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Citations (Scopus)

Abstract

In this paper, we present a new speech quality assessment method to estimate the quality of degraded speech without the reference speech. The traditional non-intrusive assessment methods cannot meet the requirement of high consistency with subjective results owing to the lack of original reference signals. To solve these issues, deep belief network is trained to produce pseudo-reference speech signal of degraded speech. Then mel-frequency cepstrum coefficients of pseudo-reference speech and degraded speech are extracted to calculate feature differences. The feature differences are mapped to speech quality score using backpropagation neural network. Experiments are conducted in a dataset containing various degraded speech signals and subjective listening scores. When compared with the standardization ITU-T P.563, Gaussian Mixture Model method and the autoencoder-based method, the proposed method brings about a higher correlation coefficient between predicted scores and subjective scores.

Original languageEnglish
Title of host publication2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages71-75
Number of pages5
ISBN (Electronic)9781538656273
DOIs
Publication statusPublished - 2 Jul 2018
Event11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Taipei, Taiwan, Province of China
Duration: 26 Nov 201829 Nov 2018

Publication series

Name2018 11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018 - Proceedings

Conference

Conference11th International Symposium on Chinese Spoken Language Processing, ISCSLP 2018
Country/TerritoryTaiwan, Province of China
CityTaipei
Period26/11/1829/11/18

Keywords

  • Backpropagation neural network
  • Deep belief network
  • Mel-frequency cepstrum coefficients
  • Non-intrusive speech quality assessment

Fingerprint

Dive into the research topics of 'Non-intrusive speech quality assessment using deep belief network and backpropagation neural network'. Together they form a unique fingerprint.

Cite this