Noise-aware Speech Separation Based on Dual-Path RNN Model

  • Tongkun Xing*
  • , Ding Ding
  • , Nan Liu
  • , Xudong Zhou
  • , Fengming Liu
  • , Yu Wang
  • , Jing Zhang
  • , Guozheng Li
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Deep learning-based speech separation has made remarkable progress recently, especially in time-domain methods such as dual-path recurrent neural networks (DPRNN). Despite these advancements, existing methods encounter significant challenges when addressing highly complex acoustic environments or meeting increasingly stringent separation quality requirements. In this paper, we propose improvements to the DPRNN framework, including optimized convolutional parallelism, an attention module for feature enhancement, and an improved residual module. Evaluations on the self-synthesized dataset and the widely adopted LibriMix benchmark show that our improved DPRNN outperforms the original model in terms of signal-to-distortion ratio (SDRi) and scale-invariant signal-to-noise ratio (SI-SNRi) with limited increase in model size. Its practical applicability is further verified in challenging environments, which will help develop efficient and robust speech separation systems.

Original languageEnglish
Title of host publication2025 IEEE 3rd International Conference on Sensors, Electronics and Computer Engineering, ICSECE 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1071-1075
Number of pages5
ISBN (Electronic)9798331503567
DOIs
Publication statusPublished - 2025
Event3rd IEEE International Conference on Sensors, Electronics and Computer Engineering, ICSECE 2025 - Jinzhou, China
Duration: 29 Aug 202531 Aug 2025

Publication series

Name2025 IEEE 3rd International Conference on Sensors, Electronics and Computer Engineering, ICSECE 2025

Conference

Conference3rd IEEE International Conference on Sensors, Electronics and Computer Engineering, ICSECE 2025
Country/TerritoryChina
CityJinzhou
Period29/08/2531/08/25

Keywords

  • attention
  • convolution
  • recurrent neural networks
  • speech separation

Fingerprint

Dive into the research topics of 'Noise-aware Speech Separation Based on Dual-Path RNN Model'. Together they form a unique fingerprint.

Cite this