An improved non-intrusive objective speech quality evaluation based on FGMM and FNN

Jing Wang*, Ying Zhang, Yuling Song, Shenghui Zhao, Jingming Kuang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

An improved non-intrusive objective speech quality evaluation method is proposed based on Fuzzy Gaussian Mixture Model (FGMM) and Fuzzy Neural Network (FNN). The degraded speech is separated into three classes (unvoiced, voiced and silence), then for each class the consistency measurement between Perceptual Linear Predictive (PLP) features of the degraded speech and the pre-trained FGMM reference model is calculated and mapped to an objective speech quality score using FNN mapping method. The proposed method performs better than the previous work using GMM and ITU-T P.563 under the test conditions used in this paper.

Original languageEnglish
Title of host publicationProceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010
Pages3495-3499
Number of pages5
DOIs
Publication statusPublished - 2010
Event2010 3rd International Congress on Image and Signal Processing, CISP 2010 - Yantai, China
Duration: 16 Oct 201018 Oct 2010

Publication series

NameProceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010
Volume7

Conference

Conference2010 3rd International Congress on Image and Signal Processing, CISP 2010
Country/TerritoryChina
CityYantai
Period16/10/1018/10/10

Keywords

  • Fuzzy Gaussian mixture model (FGMM)
  • Fuzzy neural network (FNN)
  • Non-intrusive
  • Objective evaluation
  • Speech quality

Fingerprint

Dive into the research topics of 'An improved non-intrusive objective speech quality evaluation based on FGMM and FNN'. Together they form a unique fingerprint.

Cite this