An Investigation on Data Augmentation and Multiple Instance Learning for Diagnosis of COVID-19 from Speech and Cough Sound

Tomoya Koike, Zhihua Wang, Kun Qian*, Bin Hu*, Björn W. Schuller, Yoshiharu Yamamoto

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Computer audition based approaches for diagnosing COVID-19 can provide a low-cost, convenient, and real-time solution for combating the ongoing global pandemic. In this contribution, we present an investigation on data augmentation and multiple instance learning methods for diagnosis of COVID-19 from speech and cough sound data. We firstly introduce a novel deep convolutional neural network pre-trained on large scale audio data set, i. e., AudioSet. Moreover, we use a multiple instance learning paradigm to address the training difficulties caused by the varied length of the audio instances. Experimental results demonstrate the efficiency of the proposed methods, which can reach a best performance at 75.9 % of the unweighted average recall, surpassing the official baseline single best by 3.0 % and baseline fusion best by 2.0 %.

Original languageEnglish
Title of host publication2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages783-784
Number of pages2
ISBN (Electronic)9798350324174
DOIs
Publication statusPublished - 2023
Event2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Pingtung, Taiwan, Province of China
Duration: 17 Jul 202319 Jul 2023

Publication series

Name2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023 - Proceedings

Conference

Conference2023 International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2023
Country/TerritoryTaiwan, Province of China
CityPingtung
Period17/07/2319/07/23

Fingerprint

Dive into the research topics of 'An Investigation on Data Augmentation and Multiple Instance Learning for Diagnosis of COVID-19 from Speech and Cough Sound'. Together they form a unique fingerprint.

Cite this