TY - JOUR
T1 - Auditory Attention Decoding in Four-Talker Environment with EEG
AU - Yan, Yujie
AU - Xu, Xiran
AU - Zhu, Haolin
AU - Tian, Pei
AU - Ge, Zhongshu
AU - Wu, Xihong
AU - Chen, Jing
N1 - Publisher Copyright:
© 2024 International Speech Communication Association. All rights reserved.
PY - 2024
Y1 - 2024
N2 - Auditory Attention Decoding (AAD) is a technique that determines the focus of a listener's attention in complex auditory scenes according to cortical neural responses.Existing research largely examines two-talker scenarios, insufficient for real-world complexity.This study introduced a new AAD database for a four-talker scenario with speeches from four distinct talkers simultaneously presented and spatially separated, and listeners' EEG was recorded.Temporal response functions (TRFs) analysis showed that attended speech TRFs are stronger than each unattended speech.AAD methods based on stimulus-reconstruction (SR) and cortical spatial lateralization were employed and compared.Results indicated decoding accuracy of 77.5% in 60s (chance level of 25%) using SR.Using auditory spatial attention detection (ASAD) methods also indicated high accuracy (94.7% with DenseNet-3D in 1s), demonstrating ASAD methods' generalization performance.
AB - Auditory Attention Decoding (AAD) is a technique that determines the focus of a listener's attention in complex auditory scenes according to cortical neural responses.Existing research largely examines two-talker scenarios, insufficient for real-world complexity.This study introduced a new AAD database for a four-talker scenario with speeches from four distinct talkers simultaneously presented and spatially separated, and listeners' EEG was recorded.Temporal response functions (TRFs) analysis showed that attended speech TRFs are stronger than each unattended speech.AAD methods based on stimulus-reconstruction (SR) and cortical spatial lateralization were employed and compared.Results indicated decoding accuracy of 77.5% in 60s (chance level of 25%) using SR.Using auditory spatial attention detection (ASAD) methods also indicated high accuracy (94.7% with DenseNet-3D in 1s), demonstrating ASAD methods' generalization performance.
KW - auditory attention decoding
KW - auditory spatial attention detection
KW - DenseNet
KW - EEG
KW - stimulus reconstruction
KW - temporal response functions
UR - http://www.scopus.com/inward/record.url?scp=85205731082&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2024-739
DO - 10.21437/Interspeech.2024-739
M3 - Conference article
AN - SCOPUS:85205731082
SN - 2308-457X
SP - 432
EP - 436
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 25th Interspeech Conferece 2024
Y2 - 1 September 2024 through 5 September 2024
ER -