An End-to-End Binaural Sound Localization Model Based on the Equalization and Cancellation Theory

Tao Song, Wenwen Zhang, Jing Chen*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

The end-to-end framework has been introduced into the binaural localization modeling and achieved higher localization accuracy than the other models, however, the reasonability and interpretability for applying the related neural networks remain unclear. It has been well documented that the auditory system relies on binaural cues for sound localization, and the equalization and cancellation (EC) theory describes how the binaural cues are extracted. In this paper, an end-to-end binaural localization model is proposed based on the EC theory. In the proposed model, a convolution neural network(CNN) with a specifically designed activation function is used to implement the EC theory. The proposed model was trained in synthesized rooms and evaluated in real rooms. Experiment results show that CNN kernels learned by the proposed model are corresponding to binaural cues, and the proposed model outperforms the current end-to-end model by a 10.73% improvement in localization accuracy and a 12.91% improvement in root mean square error(RMSE).

Original languageEnglish
Title of host publicationAES Europe Spring 2022 - 152nd Audio Engineering Society Convention 2022
PublisherAudio Engineering Society
Pages275-283
Number of pages9
ISBN (Electronic)9781713855415
Publication statusPublished - 2022
Externally publishedYes
EventAES Europe Spring 2022 - 152nd Audio Engineering Society Convention 2022 - The Hague, Virtual, Netherlands
Duration: 16 May 202219 May 2022

Publication series

NameAES Europe Spring 2022 - 152nd Audio Engineering Society Convention 2022

Conference

ConferenceAES Europe Spring 2022 - 152nd Audio Engineering Society Convention 2022
Country/TerritoryNetherlands
CityThe Hague, Virtual
Period16/05/2219/05/22

Fingerprint

Dive into the research topics of 'An End-to-End Binaural Sound Localization Model Based on the Equalization and Cancellation Theory'. Together they form a unique fingerprint.

Cite this