TY - JOUR
T1 - A Bag of Wavelet Features for Snore Sound Classification
AU - Qian, Kun
AU - Schmitt, Maximilian
AU - Janott, Christoph
AU - Zhang, Zixing
AU - Heiser, Clemens
AU - Hohenhorst, Winfried
AU - Herzog, Michael
AU - Hemmert, Werner
AU - Schuller, Björn
N1 - Publisher Copyright:
© 2019, Biomedical Engineering Society.
PY - 2019/4/15
Y1 - 2019/4/15
N2 - Snore sound (SnS) classification can support a targeted surgical approach to sleep related breathing disorders. Using machine listening methods, we aim to find the location of obstruction and vibration within a subject’s upper airway. Wavelet features have been demonstrated to be efficient in the recognition of SnSs in previous studies. In this work, we use a bag-of-audio-words approach to enhance the low-level wavelet features extracted from SnS data. A Naïve Bayes model was selected as the classifier based on its superiority in initial experiments. We use SnS data collected from 219 independent subjects under drug-induced sleep endoscopy performed at three medical centres. The unweighted average recall achieved by our proposed method is 69.4%, which significantly (p< 0.005 , one-tailed z-test) outperforms the official baseline (58.5%), and beats the winner (64.2%) of the INTERSPEECH ComParE Challenge 2017 Snoring sub-challenge. In addition, the conventionally used features like formants, mel-scale frequency cepstral coefficients, subband energy ratios, spectral frequency features, and the features extracted by the openSMILE toolkit are compared with our proposed feature set. The experimental results demonstrate the effectiveness of the proposed method in SnS classification.
AB - Snore sound (SnS) classification can support a targeted surgical approach to sleep related breathing disorders. Using machine listening methods, we aim to find the location of obstruction and vibration within a subject’s upper airway. Wavelet features have been demonstrated to be efficient in the recognition of SnSs in previous studies. In this work, we use a bag-of-audio-words approach to enhance the low-level wavelet features extracted from SnS data. A Naïve Bayes model was selected as the classifier based on its superiority in initial experiments. We use SnS data collected from 219 independent subjects under drug-induced sleep endoscopy performed at three medical centres. The unweighted average recall achieved by our proposed method is 69.4%, which significantly (p< 0.005 , one-tailed z-test) outperforms the official baseline (58.5%), and beats the winner (64.2%) of the INTERSPEECH ComParE Challenge 2017 Snoring sub-challenge. In addition, the conventionally used features like formants, mel-scale frequency cepstral coefficients, subband energy ratios, spectral frequency features, and the features extracted by the openSMILE toolkit are compared with our proposed feature set. The experimental results demonstrate the effectiveness of the proposed method in SnS classification.
KW - Bag-of-audio-words
KW - Drug-induced sleep endoscopy
KW - Obstructive sleep apnea
KW - Snore sound
KW - Wavelets
UR - http://www.scopus.com/inward/record.url?scp=85062846956&partnerID=8YFLogxK
U2 - 10.1007/s10439-019-02217-0
DO - 10.1007/s10439-019-02217-0
M3 - Article
C2 - 30701397
AN - SCOPUS:85062846956
SN - 0090-6964
VL - 47
SP - 1000
EP - 1011
JO - Annals of Biomedical Engineering
JF - Annals of Biomedical Engineering
IS - 4
ER -