Explainable Stuttering Recognition Using Axial Attention

Yu Ma, Yuting Huang, Kaixiang Yuan, Guangzhe Xuan, Yongzi Yu, Hengrui Zhong, Rui Li, Jian Shen*, Kun Qian, Bin Hu, Björn W. Schuller, Yoshiharu Yamamoto

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 1
  • Captures
    • Readers: 1
see details

摘要

Stuttering is a complex speech disorder that disrupts the flow of speech, and recognizing persons who stutter (PWS) and understanding their significant struggles is crucial. With advancements in computer vision, deep neural networks offer potential for recognizing stuttering events through image-based features. In this paper, we extract image features of Wavelet Transformation (WT) and Histograms of Oriented Gradient (HOG) from audio signals. We also generate explainable images using Gradient-weighted Class Activation Mapping (Grad-CAM) as input for our final recognition model–an axial attention-based EfficientNetV2, which is trained on the Kassel State of Fluency Dataset (KSoF) to perform 8 classes recognition. Our experimental results achieved a relative percentage increase in unweighted average recall (UAR) of 4.4% compared to the baseline of ComParE 2022, demonstrating that the axial attention-based EfficientNetV2, combined with the explainable input, has the capability to detect and recognise multiple types of stuttering.

源语言英语
主期刊名Advanced Intelligent Computing Technology and Applications - 19th International Conference, ICIC 2023, Proceedings
编辑De-Shuang Huang, Prashan Premaratne, Baohua Jin, Boyang Qu, Kang-Hyun Jo, Abir Hussain
出版商Springer Science and Business Media Deutschland GmbH
209-220
页数12
ISBN(印刷版)9789819947485
DOI
出版状态已出版 - 2023
活动19th International Conference on Intelligent Computing, ICIC 2023 - Zhengzhou, 中国
期限: 10 8月 202313 8月 2023

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14088 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议19th International Conference on Intelligent Computing, ICIC 2023
国家/地区中国
Zhengzhou
时期10/08/2313/08/23

指纹

探究 'Explainable Stuttering Recognition Using Axial Attention' 的科研主题。它们共同构成独一无二的指纹。

引用此

Ma, Y., Huang, Y., Yuan, K., Xuan, G., Yu, Y., Zhong, H., Li, R., Shen, J., Qian, K., Hu, B., Schuller, B. W., & Yamamoto, Y. (2023). Explainable Stuttering Recognition Using Axial Attention. 在 D.-S. Huang, P. Premaratne, B. Jin, B. Qu, K.-H. Jo, & A. Hussain (编辑), Advanced Intelligent Computing Technology and Applications - 19th International Conference, ICIC 2023, Proceedings (页码 209-220). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 14088 LNCS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-99-4749-2_18