跳到主要导航 跳到搜索 跳到主要内容

Proposal Semantic Relationship Graph Network for Temporal Action Detection

  • Shaowen Su*
  • , Yan Zhang
  • , Minggang Gan
  • *此作品的通讯作者
  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

Temporal action detection, a critical task in video activity understanding, is typically divided into two stages: proposal generation and classification. However, most existing methods overlook the importance of information transfer among proposals during classification, often treating each proposal in isolation, which hampers accurate label prediction. In this article, we propose a novel method for inferring semantic relationships both within and between action proposals, guiding the fusion of action proposal features accordingly. Building on this approach, we introduce the Proposal Semantic Relationship Graph Network (PSRGN), an end-to-end model that leverages intra-proposal semantic relationship graphs to extract cross-scale temporal context and an inter-proposal semantic relationship graph to incorporate complementary neighboring information, significantly improving proposal feature quality and overall detection performance. This is the first method to apply graph structure learning in temporal action detection, adaptively constructing the inter-proposal semantic graph. Extensive experiments on two datasets demonstrate the effectiveness of our approach, achieving state-of-the-art (SOTA). Code and results are available at http://github.com/Riiick2011/PSRGN.

源语言英语
文章编号ART135
期刊ACM Transactions on Intelligent Systems and Technology
15
6
DOI
出版状态已出版 - 13 12月 2024

指纹

探究 'Proposal Semantic Relationship Graph Network for Temporal Action Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此