Human Sound Classification based on Feature Fusion Method with Air and Bone Conducted Signal

Liang Xu, Jing Wang, Lizhong Wang, Sijun Bi, Jianqian Zhang, Qiuyue Ma

科研成果: 期刊稿件会议文章同行评审

2 引用 (Scopus)

摘要

The human sound classification task aims at distinguishing different sounds made by human, which can be widely used in medical and health detection area. Different from other sounds in acoustic scene classification task, human sounds can be transmitted either through air or bone conduction. The bone conducted (BC) signal generated by a speaker has strong anti-noise properties and can assist the air conducted (AC) signal to extract additional acoustic features. In this paper, we explore the effect of the BC signal on human sound classification task. Two stream audios combing BC and AC signals are input to a CNN-based model. An attentional feature fusion method suitable for BC and AC signal features is proposed to improve the performance according to the complementarity between the two signal features. Further improvement can be obtained by using a BC signal feature enhancement method. Experiments on an open access and a self-built dataset show that fusing bone conducted signal can achieve 6.2%/17.4% performance improvement over the baseline with only AC signal as input. The results demonstrate the application value of bone conducted signals and the superior performance of the proposed methods.

源语言英语
页(从-至)1506-1510
页数5
期刊Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
2022-September
DOI
出版状态已出版 - 2022
活动23rd Annual Conference of the International Speech Communication Association, INTERSPEECH 2022 - Incheon, 韩国
期限: 18 9月 202222 9月 2022

指纹

探究 'Human Sound Classification based on Feature Fusion Method with Air and Bone Conducted Signal' 的科研主题。它们共同构成独一无二的指纹。

引用此