TY - JOUR
T1 - An improved fully convolutional network based on post-processing with global variance equalization and noise-aware training for speech enhancement
AU - Li, Wenlong
AU - Hirota, Kaoru
AU - Dai, Yaping
AU - Jia, Zhiyang
N1 - Publisher Copyright:
© 2021 Fuji Technology Press. All rights reserved.
PY - 2021/1/20
Y1 - 2021/1/20
N2 - An improved fully convolutional network based on post-processing with global variance (GV) equalization and noise-aware training (PN-FCN) for speech enhancement model is proposed. It aims at reducing the complexity of the speech improvement system, and it solves overly smooth speech signal spectrogram problem and poor generalization capability. The PN-FCN is fed with the noisy speech samples augmented with an estimate of the noise. In this way, the PN-FCN uses additional online noise information to better predict the clean speech. Besides, PN-FCN uses the global variance information, which improve the subjective score in a voice conversion task. Finally, the proposed framework adopts FCN, and the number of parameters is one-seventh of deep neural network (DNN). Results of experiments on the Valentini-Botinhaos dataset demonstrate that the proposed framework achieves improvements in both denoising effect and model training speed.
AB - An improved fully convolutional network based on post-processing with global variance (GV) equalization and noise-aware training (PN-FCN) for speech enhancement model is proposed. It aims at reducing the complexity of the speech improvement system, and it solves overly smooth speech signal spectrogram problem and poor generalization capability. The PN-FCN is fed with the noisy speech samples augmented with an estimate of the noise. In this way, the PN-FCN uses additional online noise information to better predict the clean speech. Besides, PN-FCN uses the global variance information, which improve the subjective score in a voice conversion task. Finally, the proposed framework adopts FCN, and the number of parameters is one-seventh of deep neural network (DNN). Results of experiments on the Valentini-Botinhaos dataset demonstrate that the proposed framework achieves improvements in both denoising effect and model training speed.
KW - Fully convolutional network
KW - Noise-aware training
KW - Post-processing with global variance equalization
KW - Speech enhancement
UR - http://www.scopus.com/inward/record.url?scp=85100507691&partnerID=8YFLogxK
U2 - 10.20965/JACIII.2021.P0130
DO - 10.20965/JACIII.2021.P0130
M3 - Article
AN - SCOPUS:85100507691
SN - 1343-0130
VL - 25
SP - 130
EP - 137
JO - Journal of Advanced Computational Intelligence and Intelligent Informatics
JF - Journal of Advanced Computational Intelligence and Intelligent Informatics
IS - 1
ER -