Teaching machines on snoring: A benchmark on computer audition for snore sound excitation localisation

Kun Qian; Christoph Janott; Zixing Zhang; Jun Deng; Alice Baird; Clemens Heiser; Winfried Hohenhorst; Michael Herzog; Werner Hemmert; Björn Schuller

doi:10.24425/123918

Teaching machines on snoring: A benchmark on computer audition for snore sound excitation localisation

Kun Qian^*, Christoph Janott, Zixing Zhang, Jun Deng, Alice Baird, Clemens Heiser, Winfried Hohenhorst, Michael Herzog, Werner Hemmert, Björn Schuller

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

13 引用（Scopus）

摘要

This paper proposes a comprehensive study on machine listening for localisation of snore sound excitation. Here we investigate the effects of varied frame sizes, and overlap of the analysed audio chunk for extracting low-level descriptors. In addition, we explore the performance of each kind of feature when it is fed into varied classifier models, including support vector machines, k-nearest neighbours, linear discriminant analysis, random forests, extreme learning machines, kernel-based extreme learning machines, multilayer perceptrons, and deep neural networks. Experimental results demonstrate that, wavelet packet transform energy can outperform most other features. A deep neural network trained with subband energy ratios reaches the highest performance achieving an unweighted average recall of 72.8% from four types for snoring.

源语言	英语
页（从-至）	465-475
页数	11
期刊	Archives of Acoustics
卷	43
期	3
DOI	https://doi.org/10.24425/123918
出版状态	已出版 - 2018
已对外发布	是

访问文件

10.24425/123918

其它文件与链接

链接到 Scopus 的出版物

引用此

Qian, K., Janott, C., Zhang, Z., Deng, J., Baird, A., Heiser, C., Hohenhorst, W., Herzog, M., Hemmert, W., & Schuller, B. (2018). Teaching machines on snoring: A benchmark on computer audition for snore sound excitation localisation. Archives of Acoustics, 43(3), 465-475. https://doi.org/10.24425/123918

@article{48f5f690d56542f48c1942e4cffd4939,

title = "Teaching machines on snoring: A benchmark on computer audition for snore sound excitation localisation",

abstract = "This paper proposes a comprehensive study on machine listening for localisation of snore sound excitation. Here we investigate the effects of varied frame sizes, and overlap of the analysed audio chunk for extracting low-level descriptors. In addition, we explore the performance of each kind of feature when it is fed into varied classifier models, including support vector machines, k-nearest neighbours, linear discriminant analysis, random forests, extreme learning machines, kernel-based extreme learning machines, multilayer perceptrons, and deep neural networks. Experimental results demonstrate that, wavelet packet transform energy can outperform most other features. A deep neural network trained with subband energy ratios reaches the highest performance achieving an unweighted average recall of 72.8% from four types for snoring.",

keywords = "Acoustic features, Machine learning, Obstructive sleep apnea, Snore sound",

author = "Kun Qian and Christoph Janott and Zixing Zhang and Jun Deng and Alice Baird and Clemens Heiser and Winfried Hohenhorst and Michael Herzog and Werner Hemmert and Bj{\"o}rn Schuller",

note = "Publisher Copyright: Copyright {\textcopyright} 2018 by PAN – IPPT.",

year = "2018",

doi = "10.24425/123918",

language = "English",

volume = "43",

pages = "465--475",

journal = "Archives of Acoustics",

issn = "0137-5075",

publisher = "Polish Academy of Sciences, Committee on Acoustics",

number = "3",

}

TY - JOUR

T1 - Teaching machines on snoring

T2 - A benchmark on computer audition for snore sound excitation localisation

AU - Qian, Kun

AU - Janott, Christoph

AU - Zhang, Zixing

AU - Deng, Jun

AU - Baird, Alice

AU - Heiser, Clemens

AU - Hohenhorst, Winfried

AU - Herzog, Michael

AU - Hemmert, Werner

AU - Schuller, Björn

PY - 2018

Y1 - 2018

N2 - This paper proposes a comprehensive study on machine listening for localisation of snore sound excitation. Here we investigate the effects of varied frame sizes, and overlap of the analysed audio chunk for extracting low-level descriptors. In addition, we explore the performance of each kind of feature when it is fed into varied classifier models, including support vector machines, k-nearest neighbours, linear discriminant analysis, random forests, extreme learning machines, kernel-based extreme learning machines, multilayer perceptrons, and deep neural networks. Experimental results demonstrate that, wavelet packet transform energy can outperform most other features. A deep neural network trained with subband energy ratios reaches the highest performance achieving an unweighted average recall of 72.8% from four types for snoring.

AB - This paper proposes a comprehensive study on machine listening for localisation of snore sound excitation. Here we investigate the effects of varied frame sizes, and overlap of the analysed audio chunk for extracting low-level descriptors. In addition, we explore the performance of each kind of feature when it is fed into varied classifier models, including support vector machines, k-nearest neighbours, linear discriminant analysis, random forests, extreme learning machines, kernel-based extreme learning machines, multilayer perceptrons, and deep neural networks. Experimental results demonstrate that, wavelet packet transform energy can outperform most other features. A deep neural network trained with subband energy ratios reaches the highest performance achieving an unweighted average recall of 72.8% from four types for snoring.

KW - Acoustic features

KW - Machine learning

KW - Obstructive sleep apnea

KW - Snore sound

UR - http://www.scopus.com/inward/record.url?scp=85054034971&partnerID=8YFLogxK

U2 - 10.24425/123918

DO - 10.24425/123918

M3 - Article

AN - SCOPUS:85054034971

SN - 0137-5075

VL - 43

SP - 465

EP - 475

JO - Archives of Acoustics

JF - Archives of Acoustics

IS - 3

ER -

Teaching machines on snoring: A benchmark on computer audition for snore sound excitation localisation

摘要

访问文件

其它文件与链接

指纹

引用此