跳到主要导航 跳到搜索 跳到主要内容

Soft-Median Selection: An adaptive feature smoothening method for sound event detection

  • Fengnian Zhao
  • , Ruwei Li*
  • , Xin Liu
  • , Liwen Xu
  • *此作品的通讯作者
  • Beijing University of Technology
  • Huawei Technologies Co., Ltd.

科研成果: 期刊稿件文章同行评审

摘要

The existing Sound Event Detection (SED) algorithms pay too much attention to the differences between the internal frames of the events but do not pay enough attention to their boundaries. This situation leads to event splitting, false negatives, and inaccurate start and end times, reducing the SED performance. In order to solve this problem, this paper proposes the Soft-Median Selection (SMS) to smoothen the features of frames in the time axis adaptively. Firstly, the Differentiable Soft-Median Filter (DSMF) is designed as a filter to be applied to a neural network appropriately. Secondly, the DSMFs and a Linear Selection are combined as the SMS. The DSMFs of different lengths are used to smoothen the features to different degrees, and the Linear Selection adaptively synthesizes the smoothened features. Since the weight of each DSMF is learned, SMS can adaptively smoothen features without setting parameters in advance and thus has good generalization ability. The proposed DSMF solves the problem that the gradient cannot propagate across the median filter, and the propagation is not smooth. The experimental results show that the proposed SED algorithm based on SMS can effectively improve edge detection accuracy and make the internal prediction results of sound events more stable. The SMS-based SED algorithm's Event-based F1 Score (EBFS) is 21.7% higher than the baseline and 3.0% higher than the winning algorithm in Task 4 of Detection and Classification of Acoustic Scenes and Events (DCASE) 2019.

源语言英语
文章编号108715
期刊Applied Acoustics
192
DOI
出版状态已出版 - 4月 2022
已对外发布

指纹

探究 'Soft-Median Selection: An adaptive feature smoothening method for sound event detection' 的科研主题。它们共同构成独一无二的指纹。

引用此