Non-Intrusive Audio Quality Assessment Based on Deep Neural Network for Subjective MOS Prediction

Xinwen Yue, Yupei Zhang, Jianqian Zhang, Zhiyu Li, Jing Wang, Shenghui Zhao

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Non-intrusive audio quality assessment, particularly for subjective MOS prediction for music signal, is crucial in realtime audio communication and playback systems. While network-based methods have been extensively used for objective speech quality assessment, evaluating audio quality presents a greater challenge due to higher sampling rates and more complex signal spectrum. In this paper, we design a non-intrusive audio quality assessment system based on deep neural network for subjective MOS prediction of distorted audio signals. Mixed perceptual features are extracted for signal analysis, and both objective and subjective indicators are utilized as labels for two-step training on simulated data. Besides, we apply improved convolution layers, attention layers, and a type of new loss to improve the performance of our model. The experimental results show that the proposed system performs better than conventional assessment methods in correlation.

Original languageEnglish
Title of host publication2024 14th International Symposium on Chinese Spoken Language Processing, ISCSLP 2024
EditorsYanmin Qian, Qin Jin, Zhijian Ou, Zhenhua Ling, Zhiyong Wu, Ya Li, Lei Xie, Jianhua Tao
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages76-80
Number of pages5
ISBN (Electronic)9798331516826
DOIs
Publication statusPublished - 2024
Event14th International Symposium on Chinese Spoken Language Processing, ISCSLP 2024 - Beijing, China
Duration: 7 Nov 202410 Nov 2024

Publication series

Name2024 14th International Symposium on Chinese Spoken Language Processing, ISCSLP 2024

Conference

Conference14th International Symposium on Chinese Spoken Language Processing, ISCSLP 2024
Country/TerritoryChina
CityBeijing
Period7/11/2410/11/24

Keywords

  • Audio quality assessment
  • Deep learning
  • MOS prediction
  • Non-intrusive

Fingerprint

Dive into the research topics of 'Non-Intrusive Audio Quality Assessment Based on Deep Neural Network for Subjective MOS Prediction'. Together they form a unique fingerprint.

Cite this