A GLANCE-AND-GAZE NETWORK FOR RESPIRATORY SOUND CLASSIFICATION

Shuai Yu, Yiwei Ding, Kun Qian*, Bin Hu*, Wei Li, Björn W. Schuller

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Citations (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 7
  • Captures
    • Readers: 12
see details

Abstract

A plethora of great successes has been achieved by the existing convolutional neural networks (CNN) for respiratory sound classification. Nevertheless, simultaneously capturing both the local and global features can never be an easy task due to the limitation of a CNN's structure. In this contribution, we propose a novel glance-and-gaze network to address the aforementioned issue. The glance block aims to learn global information, while the gaze block is responsible for learning local patterns and suppressing the noises that attenuates the final performance. In the proposed method, both the global and local information can be extracted. Moreover, the spectral and temporal representations can be learnt via a feature fusion module. Experimental results on the largest public respiratory sound database demonstrate that the proposed model outperforms the state-of-the-art methods.

Original languageEnglish
Title of host publication2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages9007-9011
Number of pages5
ISBN (Electronic)9781665405409
DOIs
Publication statusPublished - 2022
Event47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Virtual, Online, Singapore
Duration: 23 May 202227 May 2022

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume2022-May
ISSN (Print)1520-6149

Conference

Conference47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022
Country/TerritorySingapore
CityVirtual, Online
Period23/05/2227/05/22

Keywords

  • Computer Audition
  • Digital Health
  • Feature Fusion
  • Glance-and-Gaze Network
  • Respiratory Sound Classification

Fingerprint

Dive into the research topics of 'A GLANCE-AND-GAZE NETWORK FOR RESPIRATORY SOUND CLASSIFICATION'. Together they form a unique fingerprint.

Cite this

Yu, S., Ding, Y., Qian, K., Hu, B., Li, W., & Schuller, B. W. (2022). A GLANCE-AND-GAZE NETWORK FOR RESPIRATORY SOUND CLASSIFICATION. In 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Proceedings (pp. 9007-9011). (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 2022-May). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP43922.2022.9746053