A Lightweight Multi-Level Relation Network for Few-shot Action Recognition

Enqi Liu, Liyuan Pan*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Few-shot (FS) action recognition classifies new actions with limited training samples. Most existing works focus on the variability between actions/videos by designing for either feature extraction methods or training strategies. However, they ignore the relations for a same action at different time clips, which is crucial to improve class-specific discriminability. In this paper, we propose a lightweight multi-level relation network (MLRN) that considers the variability of an action that inner- and cross-video, based on episodic training strategies. Furthermore, a query-support similarity classifier is introduced to improve the class identifiability by enhancing the feature utilisation at different levels. Experiments on three challenging benchmarks demonstrate that the proposed MLRN outperforms state-of-the-art methods while using approximately 50% fewer trainable parameters.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Multimedia and Expo, ICME 2024
PublisherIEEE Computer Society
ISBN (Electronic)9798350390155
DOIs
Publication statusPublished - 2024
Event2024 IEEE International Conference on Multimedia and Expo, ICME 2024 - Niagra Falls, Canada
Duration: 15 Jul 202419 Jul 2024

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2024 IEEE International Conference on Multimedia and Expo, ICME 2024
Country/TerritoryCanada
CityNiagra Falls
Period15/07/2419/07/24

Keywords

  • Few-shot action recognition
  • lightweight
  • multi-level relation
  • similarity classifier

Fingerprint

Dive into the research topics of 'A Lightweight Multi-Level Relation Network for Few-shot Action Recognition'. Together they form a unique fingerprint.

Cite this