TY - GEN
T1 - PFGM++ Combined with Stochastic Regeneration for Speech Enhancement
AU - Cao, Xiao
AU - Zhao, Shenghui
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Diffusion models have been applied in speech enhancement due to its capability to learn complex data distributions. However, the extended Poisson flow generative model (PFGM++) outperforms the diffusion models in terms of robustness. In this work, we introduce PFGM++ to speech enhancement, and SR-PFGM++, which samples using ordinary differential equation (ODE), is proposed by combining the stochastic regeneration model (StoRM) with PFGM++. The testing results on the VoiceBank-DEMAND dataset show that SR-PFGM++ achieves a higher performance with fewer sampling steps compared with StoRM. We also performed a mismatch test on the TIMIT+NOISE92 dataset and the results show the strong generalization capability of SR-PFGM++.
AB - Diffusion models have been applied in speech enhancement due to its capability to learn complex data distributions. However, the extended Poisson flow generative model (PFGM++) outperforms the diffusion models in terms of robustness. In this work, we introduce PFGM++ to speech enhancement, and SR-PFGM++, which samples using ordinary differential equation (ODE), is proposed by combining the stochastic regeneration model (StoRM) with PFGM++. The testing results on the VoiceBank-DEMAND dataset show that SR-PFGM++ achieves a higher performance with fewer sampling steps compared with StoRM. We also performed a mismatch test on the TIMIT+NOISE92 dataset and the results show the strong generalization capability of SR-PFGM++.
KW - PFGM++
KW - score-based generative model
KW - speech enhancement
KW - stochastic regeneration
UR - http://www.scopus.com/inward/record.url?scp=85206098775&partnerID=8YFLogxK
U2 - 10.1109/ICSIP61881.2024.10671434
DO - 10.1109/ICSIP61881.2024.10671434
M3 - Conference contribution
AN - SCOPUS:85206098775
T3 - 2024 9th International Conference on Signal and Image Processing, ICSIP 2024
SP - 267
EP - 271
BT - 2024 9th International Conference on Signal and Image Processing, ICSIP 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on Signal and Image Processing, ICSIP 2024
Y2 - 12 July 2024 through 14 July 2024
ER -