Residual Unet with Attention Mechanism for Time-Frequency Domain Speech Enhancement

Hanyu Chen, Xiwei Peng, Qiqi Jiang, Yujie Guo

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Eliminating the negative effects of background environmental noise is an interesting and challenging task in audio processing. In recent years, denoising technology based on neural networks (NN) has achieved good performance. In particular, the structure based on the convolutional encoder and decoder has been proven to achieve good enhancement effects. On this basis, this paper proposes a residual unet structure combined with the attention mechanism. Effectively reduce the impact of gradient disappearance on network training, and improve the semantic gap between encoder output and decoder output due to unet shortcut connections. The experimental results show that compared with the DNN baseline and unet network, the enhanced voice quality has been significantly improved.

源语言英语
主期刊名Proceedings of the 41st Chinese Control Conference, CCC 2022
编辑Zhijun Li, Jian Sun
出版商IEEE Computer Society
7007-7011
页数5
ISBN(电子版)9789887581536
DOI
出版状态已出版 - 2022
活动41st Chinese Control Conference, CCC 2022 - Hefei, 中国
期限: 25 7月 202227 7月 2022

出版系列

姓名Chinese Control Conference, CCC
2022-July
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议41st Chinese Control Conference, CCC 2022
国家/地区中国
Hefei
时期25/07/2227/07/22

指纹

探究 'Residual Unet with Attention Mechanism for Time-Frequency Domain Speech Enhancement' 的科研主题。它们共同构成独一无二的指纹。

引用此

Chen, H., Peng, X., Jiang, Q., & Guo, Y. (2022). Residual Unet with Attention Mechanism for Time-Frequency Domain Speech Enhancement. 在 Z. Li, & J. Sun (编辑), Proceedings of the 41st Chinese Control Conference, CCC 2022 (页码 7007-7011). (Chinese Control Conference, CCC; 卷 2022-July). IEEE Computer Society. https://doi.org/10.23919/CCC55666.2022.9902215