Deep learning for binaural sound source localization with low signal-to-noise ratio

Fengnian Zhao, Ruwei Li*, Dongmei Pan

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

1 Citation (Scopus)

Abstract

A novel deep learning (DL) method is proposed for binaural sound source localization with low SNR. Firstly, the binaural sound signals are decomposed into several channels by using Gammatone filter. Secondly, the 4 feature parameters of Head-related Transfer Function, interaural time difference (ITD), interaural coherence (IC), interaural level difference (ILD), and interaural phase difference (IPD) are extracted. Thirdly, ITD and IC go through a Deep Belief Network (DBN) to determine the quadrant of the sound source and reduce the positioning range. Then, ITD, IC, ILD, and IPD go through a Deep Neural Network (DNN) to obtain the azimuthal angle within 90 degrees. Experimental results show that the proposed algorithm can solve the front-back confusion, and obtain a superior performance with lower complexity and higher precision under low SNR conditions.

Original languageEnglish
Article number012017
JournalJournal of Physics: Conference Series
Volume1828
Issue number1
DOIs
Publication statusPublished - 4 Mar 2021
Externally publishedYes
Event2020 International Symposium on Automation, Information and Computing, ISAIC 2020 - Beijing, Virtual, China
Duration: 2 Dec 20204 Dec 2020

Fingerprint

Dive into the research topics of 'Deep learning for binaural sound source localization with low signal-to-noise ratio'. Together they form a unique fingerprint.

Cite this

Zhao, F., Li, R., & Pan, D. (2021). Deep learning for binaural sound source localization with low signal-to-noise ratio. Journal of Physics: Conference Series, 1828(1), Article 012017. https://doi.org/10.1088/1742-6596/1828/1/012017