Variable rate characteristic waveform interpolation speech coder based on phonetic classification

Jing Wang*, Jing Ming Kuang, Sheng Hui Zhao

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

A variable-bit-rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.8 kbit/s average bit rate which integrates phonetic classification into characteristic waveform (CW) decomposition is proposed. Each input frame is classified into one of 4 phonetic classes. Non-speech frames are represented with Bark-band noise model. The extracted CWs become rapidly evolving waveforms (REWs) or slowly evolving waveforms (SEWs) in the cases of unvoiced or stationary voiced frames respectively, while mixed voiced frames use the same CW decomposition as that in the conventional CWI. Experimental results show that the proposed codec can eliminate most buzzy and noisy artifacts existing in the fixed-bit-rate characteristic waveform interpolation (FBR-CWI) speech codec, the average bit rate can be much lower, and its reconstructed speech quality is much better than FS 1016 CELP at 4.8 kbit/s and similar to G. 723.1 ACELP at 5.3 kbit/s.

源语言英语
页(从-至)187-192
页数6
期刊Journal of Beijing Institute of Technology (English Edition)
16
2
出版状态已出版 - 6月 2007

指纹

探究 'Variable rate characteristic waveform interpolation speech coder based on phonetic classification' 的科研主题。它们共同构成独一无二的指纹。

引用此