An improved non-intrusive objective speech quality evaluation based on FGMM and FNN

Jing Wang; Ying Zhang; Yuling Song; Shenghui Zhao; Jingming Kuang

doi:10.1109/CISP.2010.5646757

An improved non-intrusive objective speech quality evaluation based on FGMM and FNN

Jing Wang^*, Ying Zhang, Yuling Song, Shenghui Zhao, Jingming Kuang

^*此作品的通讯作者

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

1 引用（Scopus）

摘要

An improved non-intrusive objective speech quality evaluation method is proposed based on Fuzzy Gaussian Mixture Model (FGMM) and Fuzzy Neural Network (FNN). The degraded speech is separated into three classes (unvoiced, voiced and silence), then for each class the consistency measurement between Perceptual Linear Predictive (PLP) features of the degraded speech and the pre-trained FGMM reference model is calculated and mapped to an objective speech quality score using FNN mapping method. The proposed method performs better than the previous work using GMM and ITU-T P.563 under the test conditions used in this paper.

源语言	英语
主期刊名	Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010
页	3495-3499
页数	5
DOI	https://doi.org/10.1109/CISP.2010.5646757
出版状态	已出版 - 2010
活动	2010 3rd International Congress on Image and Signal Processing, CISP 2010 - Yantai, 中国期限: 16 10月 2010 → 18 10月 2010

出版系列

姓名	Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010
卷	7

会议

会议	2010 3rd International Congress on Image and Signal Processing, CISP 2010
国家/地区	中国
市	Yantai
时期	16/10/10 → 18/10/10

访问文件

10.1109/CISP.2010.5646757

其它文件与链接

链接到 Scopus 的出版物

引用此

Wang, J., Zhang, Y., Song, Y., Zhao, S., & Kuang, J. (2010). An improved non-intrusive objective speech quality evaluation based on FGMM and FNN. 在 Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010 (页码 3495-3499). 文章 5646757 (Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010; 卷 7). https://doi.org/10.1109/CISP.2010.5646757

@inproceedings{22584452bff64012af2b4dcf1a3f528d,

title = "An improved non-intrusive objective speech quality evaluation based on FGMM and FNN",

abstract = "An improved non-intrusive objective speech quality evaluation method is proposed based on Fuzzy Gaussian Mixture Model (FGMM) and Fuzzy Neural Network (FNN). The degraded speech is separated into three classes (unvoiced, voiced and silence), then for each class the consistency measurement between Perceptual Linear Predictive (PLP) features of the degraded speech and the pre-trained FGMM reference model is calculated and mapped to an objective speech quality score using FNN mapping method. The proposed method performs better than the previous work using GMM and ITU-T P.563 under the test conditions used in this paper.",

keywords = "Fuzzy Gaussian mixture model (FGMM), Fuzzy neural network (FNN), Non-intrusive, Objective evaluation, Speech quality",

author = "Jing Wang and Ying Zhang and Yuling Song and Shenghui Zhao and Jingming Kuang",

year = "2010",

doi = "10.1109/CISP.2010.5646757",

language = "English",

isbn = "9781424465149",

series = "Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010",

pages = "3495--3499",

booktitle = "Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010",

note = "2010 3rd International Congress on Image and Signal Processing, CISP 2010 ; Conference date: 16-10-2010 Through 18-10-2010",

}

Wang, J, Zhang, Y, Song, Y, Zhao, S & Kuang, J 2010, An improved non-intrusive objective speech quality evaluation based on FGMM and FNN. 在 Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010., 5646757, Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010, 卷 7, 页码 3495-3499, 2010 3rd International Congress on Image and Signal Processing, CISP 2010, Yantai, 中国, 16/10/10. https://doi.org/10.1109/CISP.2010.5646757

An improved non-intrusive objective speech quality evaluation based on FGMM and FNN. / Wang, Jing; Zhang, Ying; Song, Yuling 等.
Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010. 2010. 页码 3495-3499 5646757 (Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010; 卷 7).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - An improved non-intrusive objective speech quality evaluation based on FGMM and FNN

AU - Wang, Jing

AU - Zhang, Ying

AU - Song, Yuling

AU - Zhao, Shenghui

AU - Kuang, Jingming

PY - 2010

Y1 - 2010

N2 - An improved non-intrusive objective speech quality evaluation method is proposed based on Fuzzy Gaussian Mixture Model (FGMM) and Fuzzy Neural Network (FNN). The degraded speech is separated into three classes (unvoiced, voiced and silence), then for each class the consistency measurement between Perceptual Linear Predictive (PLP) features of the degraded speech and the pre-trained FGMM reference model is calculated and mapped to an objective speech quality score using FNN mapping method. The proposed method performs better than the previous work using GMM and ITU-T P.563 under the test conditions used in this paper.

AB - An improved non-intrusive objective speech quality evaluation method is proposed based on Fuzzy Gaussian Mixture Model (FGMM) and Fuzzy Neural Network (FNN). The degraded speech is separated into three classes (unvoiced, voiced and silence), then for each class the consistency measurement between Perceptual Linear Predictive (PLP) features of the degraded speech and the pre-trained FGMM reference model is calculated and mapped to an objective speech quality score using FNN mapping method. The proposed method performs better than the previous work using GMM and ITU-T P.563 under the test conditions used in this paper.

KW - Fuzzy Gaussian mixture model (FGMM)

KW - Fuzzy neural network (FNN)

KW - Non-intrusive

KW - Objective evaluation

KW - Speech quality

UR - http://www.scopus.com/inward/record.url?scp=78650575586&partnerID=8YFLogxK

U2 - 10.1109/CISP.2010.5646757

DO - 10.1109/CISP.2010.5646757

M3 - Conference contribution

AN - SCOPUS:78650575586

SN - 9781424465149

T3 - Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010

SP - 3495

EP - 3499

BT - Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010

T2 - 2010 3rd International Congress on Image and Signal Processing, CISP 2010

Y2 - 16 October 2010 through 18 October 2010

ER -

Wang J, Zhang Y, Song Y, Zhao S, Kuang J. An improved non-intrusive objective speech quality evaluation based on FGMM and FNN. 在 Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010. 2010. 页码 3495-3499. 5646757. (Proceedings - 2010 3rd International Congress on Image and Signal Processing, CISP 2010). doi: 10.1109/CISP.2010.5646757

An improved non-intrusive objective speech quality evaluation based on FGMM and FNN

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此