Denoised Temporal Relation Network for Temporal Action Segmentation

Zhichao Ma, Kan Li*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Temporal relations among action segments play a crucial role in temporal action segmentation. Existing methods tend to employ the graph neural network to model the temporal relation. However, the performance is unsatisfactory and exhibits serious over-segmentation due to the generated noisy features. To solve the above issues, we present an action segmentation framework, termed a denoised temporal relation network (DTRN). In DTRN, a temporal reasoning module (TRM) models inter-segment temporal relations and conducts feature denoising jointly. Specifically, the TRM conducts an uncertainty-gated reasoning mechanism for noise-immune and utilizes a cross-attention-based structure to combine the informative clues from the discriminative enhance module which is trained under Selective Margin Plasticity (SMP) to ensure informative clues, SMP adjusts the decision boundary adaptively by changing specific margins in real-time. Our framework is demonstrated to be effective and achieves state-of-the-art performance of accuracy, edit score, and F1 score on the challenging 50Salads, GTEA, and Breakfast benchmarks.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Proceedings
EditorsQingshan Liu, Hanzi Wang, Rongrong Ji, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages282-294
Number of pages13
ISBN (Print)9789819985364
DOIs
Publication statusPublished - 2024
Event6th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2023 - Xiamen, China
Duration: 13 Oct 202315 Oct 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14430 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2023
Country/TerritoryChina
CityXiamen
Period13/10/2315/10/23

Keywords

  • Denoised Temporal Relation Network
  • Selective Margin Plasticity
  • Temporal Action Segmentation

Fingerprint

Dive into the research topics of 'Denoised Temporal Relation Network for Temporal Action Segmentation'. Together they form a unique fingerprint.

Cite this