Output-based speech quality assessment using autoencoder and support vector regression

Jing Wang*, Yahui Shan, Xiang Xie, Jingming Kuang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

8 引用 (Scopus)

摘要

The output-based speech quality assessment method has been widely used and received increasing attention since it does not need undistorted signals as reference. In order to obtain a high correlation between the predicted scores and subjective results, this paper presents a new speech quality assessment method to estimate the quality of degraded speech without the reference speech. Bottleneck features are extracted with autoencoder and support vector regression is chosen as mapping model from objective representation to subjective scores. Experiments are conducted in a dataset containing various degraded speech signals and subjective listening scores. The proposed method takes advantage of autoencoder in forming a good representation of its input which can be better mapped to Mean Opinion Score. The experimental results show that compared with the standardization ITU-T P.563 and another deep learning-based assessment method, the proposed method brings about a higher correlation coefficient between predicted scores and subjective scores.

源语言英语
页(从-至)13-20
页数8
期刊Speech Communication
110
DOI
出版状态已出版 - 7月 2019

指纹

探究 'Output-based speech quality assessment using autoencoder and support vector regression' 的科研主题。它们共同构成独一无二的指纹。

引用此