Recurrent neural network for spectral mapping in speech bandwidth extension

Yingxue Wang, Shenghui Zhao, Jianxin Li, Jingming Kuang, Qiang Zhu

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)

摘要

We present a recurrent neural network (RNN) based speech bandwidth extension (BWE) method. The conventional Gaussian mixture model (GMM) based BWE methods perform stably and effectively. However, GMM based methods suffer from two fundamental and competing problems: 1) inadequacy of GMM in modeling the non-linear relationship between the low frequency (LF) and high frequency (HF), 2) temporal correlations across speech frames are ignored, resulting in spectral detail loss of the reconstructed speech by BWE. To cope these problems, a RNN is employed to capture temporal information and construct deep non-linear relationships between the spectral envelope features of LF and HF. The proposed RNN is trained layer-by-layer from a cascade of two recurrent temporal restricted Boltzmann machines (RTRBMs) and a feedforward neural network (NN). The proposed method takes advantage of the strong ability of RTRBMs in discovering the temporal correlation between adjacent frames and modeling deep non-linear relationships between input and output. Both the objective and subjective evaluations indicate that our proposed method outperforms the conventional GMM based methods and other NN based methods.

源语言英语
主期刊名2016 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2016 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
242-246
页数5
ISBN(电子版)9781509045457
DOI
出版状态已出版 - 19 4月 2017
活动2016 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2016 - Washington, 美国
期限: 7 12月 20169 12月 2016

出版系列

姓名2016 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2016 - Proceedings
2017-April

会议

会议2016 IEEE Global Conference on Signal and Information Processing, GlobalSIP 2016
国家/地区美国
Washington
时期7/12/169/12/16

指纹

探究 'Recurrent neural network for spectral mapping in speech bandwidth extension' 的科研主题。它们共同构成独一无二的指纹。

引用此