Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data

Zixing Zhang*, Jing Han, Kun Qian, Christoph Janott, Yanan Guo, Bjorn Schuller

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

42 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 42
  • Captures
    • Readers: 93
see details

摘要

One of the frontier issues that severely hamper the development of automatic snore sound classification (ASSC) associates to the lack of sufficient supervised training data. To cope with this problem, we propose a novel data augmentation approach based on semi-supervised conditional generative adversarial networks (scGANs), which aims to automatically learn a mapping strategy from a random noise space to original data distribution. The proposed approach has the capability of well synthesizing 'realistic' high-dimensional data, while requiring no additional annotation process. To handle the mode collapse problem of GANs, we further introduce an ensemble strategy to enhance the diversity of the generated data. The systematic experiments conducted on a widely used Munich-Passau snore sound corpus demonstrate that the scGANs-based systems can remarkably outperform other classic data augmentation systems, and are also competitive to other recently reported systems for ASSC.

源语言英语
文章编号8678828
页(从-至)300-310
页数11
期刊IEEE Journal of Biomedical and Health Informatics
24
1
DOI
出版状态已出版 - 1月 2020
已对外发布

指纹

探究 'Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhang, Z., Han, J., Qian, K., Janott, C., Guo, Y., & Schuller, B. (2020). Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data. IEEE Journal of Biomedical and Health Informatics, 24(1), 300-310. 文章 8678828. https://doi.org/10.1109/JBHI.2019.2907286