ConvConcatNet: A Deep Convolutional Neural Network to Reconstruct Mel Spectrogram from the EEG

Xiran Xu, Bo Wang, Yujie Yan, Haolin Zhu, Zechen Zhang, Xihong Wu, Jing Chen*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

To investigate the processing of speech in the brain, simple linear models are commonly used to establish a relationship between brain signals and speech features. However, these linear models are illequipped to model a highly dynamic and complex non-linear system like the brain. Although non-linear methods with neural networks have been developed recently, reconstructing unseen stimuli from unseen subjects' EEG is still a highly challenging task. This work presents a novel method, ConvConcatNet, to reconstruct mel-spectrograms from EEG, in which the deep convolution neural network and extensive concatenation operation were combined. With our ConvConcatNet model, the Pearson correlation between the reconstructed and the target mel-spectrogram can achieve 0.0420, which was ranked as No.1 in the Task 2 of the Auditory EEG Challenge. The codes and models to implement our work will be available on Github: https://github.com/xuxiran/ConvConcatNet

源语言英语
主期刊名2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
113-114
页数2
ISBN(电子版)9798350374513
DOI
出版状态已出版 - 2024
已对外发布
活动49th IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Seoul, 韩国
期限: 14 4月 202419 4月 2024

出版系列

姓名2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024 - Proceedings

会议

会议49th IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops, ICASSPW 2024
国家/地区韩国
Seoul
时期14/04/2419/04/24

指纹

探究 'ConvConcatNet: A Deep Convolutional Neural Network to Reconstruct Mel Spectrogram from the EEG' 的科研主题。它们共同构成独一无二的指纹。

引用此