A bag-of-audio-words approach for snore sounds' excitation localisation

Maximilian Schmitt, Christoph Janott, Vedhas Pandit, Kun Qian, Clemens Heiser, Werner Hemmert, Björn Schuller

科研成果: 书/报告/会议事项章节会议稿件同行评审

29 引用 (Scopus)

摘要

Habitual snoring and Obstructive Sleep Apnea are serious conditions that can affect the health of the snorer. For a targeted surgical treatment, it is crucial to identify the exact location of the vibration within the upper airways. As opposed to earlier work, we present the first unsupervised feature learning approach to this task based on bags-of-audio-words. Likewise, we cluster feature values within a given time-segment into acoustic 'words'. The frequency of occurrence per such word is then represented in a histogram per sound chunk to classify between four excitation locations. In extensive test runs based on snore sound data of 24 patients labelled by experts, we evaluated several feature sets as basis for audio word creation. In the result, we find audio words based on wavelet features, formants, and MFCC to be highly suited and outperform previous experiments based on the same data set.

源语言英语
主期刊名Speech Communication - 12. ITG-Fachtagung Sprachkommunikation
出版商VDE VERLAG GMBH
230-234
页数5
ISBN(电子版)9783800742752
出版状态已出版 - 2016
已对外发布
活动12. ITG-Fachtagung Sprachkommunikation - 12th ITG Conference on Speech Communication - Paderborn, 德国
期限: 5 10月 20167 10月 2016

出版系列

姓名Speech Communication - 12. ITG-Fachtagung Sprachkommunikation

会议

会议12. ITG-Fachtagung Sprachkommunikation - 12th ITG Conference on Speech Communication
国家/地区德国
Paderborn
时期5/10/167/10/16

指纹

探究 'A bag-of-audio-words approach for snore sounds' excitation localisation' 的科研主题。它们共同构成独一无二的指纹。

引用此