An Investigation on Data Augmentation and Multiple Instance Learning for Diagnosis of COVID-19 from Speech and Cough Sound

Tomoya Koike, Zhihua Wang, Kun Qian*, Bin Hu*, Björn W. Schuller, Yoshiharu Yamamoto

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Computer audition based approaches for diagnosing COVID-19 can provide a low-cost, convenient, and real-time solution for combating the ongoing global pandemic. In this contribution, we present an investigation on data augmentation and multiple instance learning methods for diagnosis of COVID-19 from speech and cough sound data. We firstly introduce a novel deep convolutional neural network pre-trained on large scale audio data set, i. e., AudioSet. Moreover, we use a multiple instance learning paradigm to address the training difficulties caused by the varied length of the audio instances. Experimental results demonstrate the efficiency of the proposed methods, which can reach a best performance at 75.9 % of the unweighted average recall, surpassing the official baseline single best by 3.0 % and baseline fusion best by 2.0 %.

源语言英语
主期刊名2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
783-784
页数2
ISBN(电子版)9798350324174
DOI
出版状态已出版 - 2023
活动2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Pingtung, 中国台湾
期限: 17 7月 202319 7月 2023

出版系列

姓名2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Proceedings

会议

会议2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023
国家/地区中国台湾
Pingtung
时期17/07/2319/07/23

指纹

探究 'An Investigation on Data Augmentation and Multiple Instance Learning for Diagnosis of COVID-19 from Speech and Cough Sound' 的科研主题。它们共同构成独一无二的指纹。

引用此