跳到主要导航 跳到搜索 跳到主要内容

Online spatio-temporal action detection with adaptive sampling and hierarchical modulation

  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

Online spatio-temporal action detection (OSTAD) is a crucial task in video understanding, responsible for identifying and categorizing action instances in video streams in an online manner. This paper presents a novel approach that employs adaptive sampling and hierarchical modulation to enhance OSTAD capabilities. Traditional methods, often constrained by fixed sampling rates, may lead to redundancy in scenarios with slower action speeds and overlook essential details in faster-moving sequences. Our innovative dynamic sampling strategy, informed by speed estimation, adaptively adjusts sampling intervals based on speed attention and visual differential features, thereby optimizing the informational content of each sampled video clip. Additionally, our method incorporates a hierarchical modulation mechanism that synergizes high-level semantic and low-level spatial information, significantly enhancing action localization and classification accuracy. The adaptive sampling network with hierarchical modulation, underpinned by these advancements, demonstrates substantial improvements on benchmark datasets such as JHMDB21 and UCF24, proving our methods’ efficacy in handling diverse and dynamic action sequences in an online setting.

源语言英语
文章编号349
期刊Multimedia Systems
30
6
DOI
出版状态已出版 - 12月 2024

指纹

探究 'Online spatio-temporal action detection with adaptive sampling and hierarchical modulation' 的科研主题。它们共同构成独一无二的指纹。

引用此