TY - JOUR
T1 - Diffuseness Estimation-Based SSTP Detection for Multiple Sound Source Localization in Reverberant Environments
AU - Zhang, Yu
AU - Jia, Maoshen
AU - Gao, Shang
AU - Wang, Jing
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2023/8
Y1 - 2023/8
N2 - This paper proposes a diffuseness estimation-based single-source time–frequency point (SSTP) detection method for multisource direction of arrival (DOA) estimation. According to the composition, time–frequency (TF) points are divided into three types: SSTP, multisource TF, and interference TF. SSTPs and multisource TF points are defined as weak interference time–frequency points (WITPs). An SSTP is a TF point consisting only of the direct component of one sound source, which is beneficial for DOA estimation. Therefore, multisource DOA estimation is transformed into single-source DOA estimation by SSTP detection. Diffuseness estimation is introduced for a sound field microphone array. WITPs are detected by a diffuseness estimation–based detection method. Phase similarity determination is adopted to identify SSTPs from detected WITPs. Multiple sound source localization is completed by searching peaks in the normalized histogram of DOA estimates corresponding to the detected SSTPs. Experiments demonstrate that the proposed method achieves the precise detection of SSTPs, and evaluations show that it has superior accuracy of multiple sound source counting and localization in reverberant and noisy environments.
AB - This paper proposes a diffuseness estimation-based single-source time–frequency point (SSTP) detection method for multisource direction of arrival (DOA) estimation. According to the composition, time–frequency (TF) points are divided into three types: SSTP, multisource TF, and interference TF. SSTPs and multisource TF points are defined as weak interference time–frequency points (WITPs). An SSTP is a TF point consisting only of the direct component of one sound source, which is beneficial for DOA estimation. Therefore, multisource DOA estimation is transformed into single-source DOA estimation by SSTP detection. Diffuseness estimation is introduced for a sound field microphone array. WITPs are detected by a diffuseness estimation–based detection method. Phase similarity determination is adopted to identify SSTPs from detected WITPs. Multiple sound source localization is completed by searching peaks in the normalized histogram of DOA estimates corresponding to the detected SSTPs. Experiments demonstrate that the proposed method achieves the precise detection of SSTPs, and evaluations show that it has superior accuracy of multiple sound source counting and localization in reverberant and noisy environments.
KW - Diffuseness estimation
KW - Direction of arrival
KW - Reverberation
KW - Sparsity component analysis
UR - http://www.scopus.com/inward/record.url?scp=85149759341&partnerID=8YFLogxK
U2 - 10.1007/s00034-023-02329-y
DO - 10.1007/s00034-023-02329-y
M3 - Article
AN - SCOPUS:85149759341
SN - 0278-081X
VL - 42
SP - 4713
EP - 4739
JO - Circuits, Systems, and Signal Processing
JF - Circuits, Systems, and Signal Processing
IS - 8
ER -