Are You Speaking with a Mask? An Investigation on Attention Based Deep Temporal Convolutional Neural Networks for Mask Detection Task

Yu Qiao, Kun Qian, Ziping Zhao*, Xiaojing Zhao

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

When writing this article, COVID-19 as a global epidemic, has affected more than 200 countries and territories globally and lead to more than 694,000 deaths. Wearing a mask is one of most convenient, cheap, and efficient precautions. Moreover, guaranteeing a quality of the speech under the condition of wearing a mask is crucial in real-world telecommunication technologies. To this line, the goal of the ComParE 2020 Mask condition recognition of speakers subchallenge is to recognize the states of speakers with or without facial masks worn. In this work, we present three modeling methods under the deep neural network framework, namely Convolutional Recurrent Neural Network(CRNN), Convolutional Temporal Convolutional Network(CTCNs) and CTCNs combined with utterance level features, respectively. Furthermore, we use cycle mode to fill the samples to further enhance the system performance. In the CTCNs model, we tried different network depths. Finally, the experimental results demonstrate the effectiveness of the CTCNs network structure, which can reach an unweighted average recall (UAR) at 66.4% on the development set. This is higher than the result of baseline, which is 64.4% in S2SAE+SVM nerwork(a significance level at p< 0.001 by one-tailed z-test). It demonstrates the good performance of our proposed network.

源语言英语
主期刊名Proceedings of the 8th Conference on Sound and Music Technology - Selected Papers from CSMT
编辑Xi Shao, Kun Qian, Li Zhou, Xin Wang, Ziping Zhao
出版商Springer Science and Business Media Deutschland GmbH
163-174
页数12
ISBN(印刷版)9789811616488
DOI
出版状态已出版 - 2021
已对外发布
活动8th Conference on Sound and Music Technology, CSMT 2020 - Taiyuan, 中国
期限: 5 11月 20208 11月 2020

出版系列

姓名Lecture Notes in Electrical Engineering
761 LNEE
ISSN(印刷版)1876-1100
ISSN(电子版)1876-1119

会议

会议8th Conference on Sound and Music Technology, CSMT 2020
国家/地区中国
Taiyuan
时期5/11/208/11/20

指纹

探究 'Are You Speaking with a Mask? An Investigation on Attention Based Deep Temporal Convolutional Neural Networks for Mask Detection Task' 的科研主题。它们共同构成独一无二的指纹。

引用此