Proposal Semantic Relationship Graph Network for Temporal Action Detection

Shaowen Su*, Yan Zhang, Minggang Gan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Temporal action detection, a critical task in video activity understanding, is typically divided into two stages: proposal generation and classification. However, most existing methods overlook the importance of information transfer among proposals during classification, often treating each proposal in isolation, which hampers accurate label prediction. In this article, we propose a novel method for inferring semantic relationships both within and between action proposals, guiding the fusion of action proposal features accordingly. Building on this approach, we introduce the Proposal Semantic Relationship Graph Network (PSRGN), an end-to-end model that leverages intra-proposal semantic relationship graphs to extract cross-scale temporal context and an inter-proposal semantic relationship graph to incorporate complementary neighboring information, significantly improving proposal feature quality and overall detection performance. This is the first method to apply graph structure learning in temporal action detection, adaptively constructing the inter-proposal semantic graph. Extensive experiments on two datasets demonstrate the effectiveness of our approach, achieving state-of-the-art (SOTA). Code and results are available at http://github.com/Riiick2011/PSRGN.

Original languageEnglish
Article numberART135
JournalACM Transactions on Intelligent Systems and Technology
Volume15
Issue number6
DOIs
Publication statusPublished - 13 Dec 2024

Keywords

  • graph convolutional network
  • graph structure learning
  • proposal semantic relationship graph
  • Temporal action detection

Fingerprint

Dive into the research topics of 'Proposal Semantic Relationship Graph Network for Temporal Action Detection'. Together they form a unique fingerprint.

Cite this