Encoding and Decoding of Chinese Phonemes Based on MEG Signals

Jinghua Liang, Bo Wang, Xihong Wu, Jing Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Phonemes, the smallest phonetic units of speech sound, bridge the connection between neural recordings and words. Decoding phonemes from non-invasive Electro-/Magnetoencephalography (EEG/MEG) has been demonstrated to be feasible. However, previous studies mainly focused on decoding with EEG/MEG that was evoked by isolated phonemes or syllable stimuli, which do not align with the continuous speech scenarios encountered in practical applications. In this study, we investigated the neural encoding and decoding of Mandarin Chinese phonemes in continuous speech using MEG signals. Firstly, a Vector Quantized Variational Autoencoder (VQ-VAE) speech model was trained to extract phoneme features from continuous speech. Subsequently, representational similarity analysis (RSA) was performed on these extracted features along with their corresponding MEG data to explore the temporal patterns of phoneme representation in MEG. Finally, neural networks were utilized to reconstruct phoneme features from MEG signals, and the decoding performance at both the segment and frame levels was evaluated. The RSA results show that Chinese phonemes exhibit significant representation in MEG signals from 160 ms to 300 ms after the phoneme onset. The decoding results show that at the segment level (3 seconds), using phoneme features decoded from MEG signals for retrieval tasks (candidate number: 987), a top-10 accuracy of 41.57% can be achieved. At the frame level, the six-class accuracy for classifying phoneme articulation manners is 36.16%.

Original languageEnglish
Title of host publication2024 14th International Symposium on Chinese Spoken Language Processing, ISCSLP 2024
EditorsYanmin Qian, Qin Jin, Zhijian Ou, Zhenhua Ling, Zhiyong Wu, Ya Li, Lei Xie, Jianhua Tao
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages224-228
Number of pages5
ISBN (Electronic)9798331516826
DOIs
Publication statusPublished - 2024
Externally publishedYes
Event14th International Symposium on Chinese Spoken Language Processing, ISCSLP 2024 - Beijing, China
Duration: 7 Nov 202410 Nov 2024

Publication series

Name2024 14th International Symposium on Chinese Spoken Language Processing, ISCSLP 2024

Conference

Conference14th International Symposium on Chinese Spoken Language Processing, ISCSLP 2024
Country/TerritoryChina
CityBeijing
Period7/11/2410/11/24

Keywords

  • Chinese phonemes
  • RSA
  • VQ-VAE
  • non-invasive neural signals
  • phoneme decoding

Fingerprint

Dive into the research topics of 'Encoding and Decoding of Chinese Phonemes Based on MEG Signals'. Together they form a unique fingerprint.

Cite this