Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data

Zixing Zhang; Jing Han; Kun Qian; Christoph Janott; Yanan Guo; Bjorn Schuller

doi:10.1109/JBHI.2019.2907286

Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data

Zixing Zhang^*, Jing Han, Kun Qian, Christoph Janott, Yanan Guo, Bjorn Schuller

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

42 引用（Scopus）

摘要

One of the frontier issues that severely hamper the development of automatic snore sound classification (ASSC) associates to the lack of sufficient supervised training data. To cope with this problem, we propose a novel data augmentation approach based on semi-supervised conditional generative adversarial networks (scGANs), which aims to automatically learn a mapping strategy from a random noise space to original data distribution. The proposed approach has the capability of well synthesizing 'realistic' high-dimensional data, while requiring no additional annotation process. To handle the mode collapse problem of GANs, we further introduce an ensemble strategy to enhance the diversity of the generated data. The systematic experiments conducted on a widely used Munich-Passau snore sound corpus demonstrate that the scGANs-based systems can remarkably outperform other classic data augmentation systems, and are also competitive to other recently reported systems for ASSC.

源语言	英语
文章编号	8678828
页（从-至）	300-310
页数	11
期刊	IEEE Journal of Biomedical and Health Informatics
卷	24
期	1
DOI	https://doi.org/10.1109/JBHI.2019.2907286
出版状态	已出版 - 1月 2020
已对外发布	是

访问文件

10.1109/JBHI.2019.2907286

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, Z., Han, J., Qian, K., Janott, C., Guo, Y., & Schuller, B. (2020). Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data. IEEE Journal of Biomedical and Health Informatics, 24(1), 300-310. 文章 8678828. https://doi.org/10.1109/JBHI.2019.2907286

@article{48a49990b9ea4c5cb96a1a8790a8df07,

title = "Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data",

abstract = "One of the frontier issues that severely hamper the development of automatic snore sound classification (ASSC) associates to the lack of sufficient supervised training data. To cope with this problem, we propose a novel data augmentation approach based on semi-supervised conditional generative adversarial networks (scGANs), which aims to automatically learn a mapping strategy from a random noise space to original data distribution. The proposed approach has the capability of well synthesizing 'realistic' high-dimensional data, while requiring no additional annotation process. To handle the mode collapse problem of GANs, we further introduce an ensemble strategy to enhance the diversity of the generated data. The systematic experiments conducted on a widely used Munich-Passau snore sound corpus demonstrate that the scGANs-based systems can remarkably outperform other classic data augmentation systems, and are also competitive to other recently reported systems for ASSC.",

keywords = "Snore sound classification, data augmentation, data synthesis, obstructive sleep apnea",

author = "Zixing Zhang and Jing Han and Kun Qian and Christoph Janott and Yanan Guo and Bjorn Schuller",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2020",

month = jan,

doi = "10.1109/JBHI.2019.2907286",

language = "English",

volume = "24",

pages = "300--310",

journal = "IEEE Journal of Biomedical and Health Informatics",

issn = "2168-2194",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "1",

}

TY - JOUR

T1 - Snore-GANs

T2 - Improving Automatic Snore Sound Classification with Synthesized Data

AU - Zhang, Zixing

AU - Han, Jing

AU - Qian, Kun

AU - Janott, Christoph

AU - Guo, Yanan

AU - Schuller, Bjorn

PY - 2020/1

Y1 - 2020/1

N2 - One of the frontier issues that severely hamper the development of automatic snore sound classification (ASSC) associates to the lack of sufficient supervised training data. To cope with this problem, we propose a novel data augmentation approach based on semi-supervised conditional generative adversarial networks (scGANs), which aims to automatically learn a mapping strategy from a random noise space to original data distribution. The proposed approach has the capability of well synthesizing 'realistic' high-dimensional data, while requiring no additional annotation process. To handle the mode collapse problem of GANs, we further introduce an ensemble strategy to enhance the diversity of the generated data. The systematic experiments conducted on a widely used Munich-Passau snore sound corpus demonstrate that the scGANs-based systems can remarkably outperform other classic data augmentation systems, and are also competitive to other recently reported systems for ASSC.

AB - One of the frontier issues that severely hamper the development of automatic snore sound classification (ASSC) associates to the lack of sufficient supervised training data. To cope with this problem, we propose a novel data augmentation approach based on semi-supervised conditional generative adversarial networks (scGANs), which aims to automatically learn a mapping strategy from a random noise space to original data distribution. The proposed approach has the capability of well synthesizing 'realistic' high-dimensional data, while requiring no additional annotation process. To handle the mode collapse problem of GANs, we further introduce an ensemble strategy to enhance the diversity of the generated data. The systematic experiments conducted on a widely used Munich-Passau snore sound corpus demonstrate that the scGANs-based systems can remarkably outperform other classic data augmentation systems, and are also competitive to other recently reported systems for ASSC.

KW - Snore sound classification

KW - data augmentation

KW - data synthesis

KW - obstructive sleep apnea

UR - http://www.scopus.com/inward/record.url?scp=85077666833&partnerID=8YFLogxK

U2 - 10.1109/JBHI.2019.2907286

DO - 10.1109/JBHI.2019.2907286

M3 - Article

C2 - 30946682

AN - SCOPUS:85077666833

SN - 2168-2194

VL - 24

SP - 300

EP - 310

JO - IEEE Journal of Biomedical and Health Informatics

JF - IEEE Journal of Biomedical and Health Informatics

IS - 1

M1 - 8678828

ER -

Snore-GANs: Improving Automatic Snore Sound Classification with Synthesized Data

摘要

访问文件

其它文件与链接

指纹

引用此