TY - JOUR
T1 - Snoring classified
T2 - The Munich-Passau Snore Sound Corpus
AU - Janott, Christoph
AU - Schmitt, Maximilian
AU - Zhang, Yue
AU - Qian, Kun
AU - Pandit, Vedhas
AU - Zhang, Zixing
AU - Heiser, Clemens
AU - Hohenhorst, Winfried
AU - Herzog, Michael
AU - Hemmert, Werner
AU - Schuller, Björn
N1 - Publisher Copyright:
© 2018 Elsevier Ltd
PY - 2018/3/1
Y1 - 2018/3/1
N2 - Objective: Snoring can be excited in different locations within the upper airways during sleep. It was hypothesised that the excitation locations are correlated with distinct acoustic characteristics of the snoring noise. To verify this hypothesis, a database of snore sounds is developed, labelled with the location of sound excitation. Methods: Video and audio recordings taken during drug induced sleep endoscopy (DISE) examinations from three medical centres have been semi-automatically screened for snore events, which subsequently have been classified by ENT experts into four classes based on the VOTE classification. The resulting dataset containing 828 snore events from 219 subjects has been split into Train, Development, and Test sets. An SVM classifier has been trained using low level descriptors (LLDs) related to energy, spectral features, mel frequency cepstral coefficients (MFCC), formants, voicing, harmonic-to-noise ratio (HNR), spectral harmonicity, pitch, and microprosodic features. Results: An unweighted average recall (UAR) of 55.8% could be achieved using the full set of LLDs including formants. Best performing subset is the MFCC-related set of LLDs. A strong difference in performance could be observed between the permutations of train, development, and test partition, which may be caused by the relatively low number of subjects included in the smaller classes of the strongly unbalanced data set. Conclusion: A database of snoring sounds is presented which are classified according to their sound excitation location based on objective criteria and verifiable video material. With the database, it could be demonstrated that machine classifiers can distinguish different excitation location of snoring sounds in the upper airway based on acoustic parameters.
AB - Objective: Snoring can be excited in different locations within the upper airways during sleep. It was hypothesised that the excitation locations are correlated with distinct acoustic characteristics of the snoring noise. To verify this hypothesis, a database of snore sounds is developed, labelled with the location of sound excitation. Methods: Video and audio recordings taken during drug induced sleep endoscopy (DISE) examinations from three medical centres have been semi-automatically screened for snore events, which subsequently have been classified by ENT experts into four classes based on the VOTE classification. The resulting dataset containing 828 snore events from 219 subjects has been split into Train, Development, and Test sets. An SVM classifier has been trained using low level descriptors (LLDs) related to energy, spectral features, mel frequency cepstral coefficients (MFCC), formants, voicing, harmonic-to-noise ratio (HNR), spectral harmonicity, pitch, and microprosodic features. Results: An unweighted average recall (UAR) of 55.8% could be achieved using the full set of LLDs including formants. Best performing subset is the MFCC-related set of LLDs. A strong difference in performance could be observed between the permutations of train, development, and test partition, which may be caused by the relatively low number of subjects included in the smaller classes of the strongly unbalanced data set. Conclusion: A database of snoring sounds is presented which are classified according to their sound excitation location based on objective criteria and verifiable video material. With the database, it could be demonstrated that machine classifiers can distinguish different excitation location of snoring sounds in the upper airway based on acoustic parameters.
KW - Drug-Induced Sleep Endoscopy
KW - Machine learning
KW - Obstructive Sleep Apnea
KW - Primary snoring
KW - Snore sound classification
UR - http://www.scopus.com/inward/record.url?scp=85041425160&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2018.01.007
DO - 10.1016/j.compbiomed.2018.01.007
M3 - Article
C2 - 29407995
AN - SCOPUS:85041425160
SN - 0010-4825
VL - 94
SP - 106
EP - 118
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
ER -