Hierarchical Motion Excitation Network for Few-Shot Video Recognition

Bing Wang, Xiaohua Wang, Shiwei Ren, Weijiang Wang, Yueting Shi*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Most of the existing deep learning algorithms are supervised learning and rely on a tremendous number of manually labeled samples. However, in most domains, due to the scarcity of samples or the excessive cost of labeling, it would be impracticable to provide numerous labeled training samples to the network. In this paper, a few-shot video classification network termed Hierarchical Motion Excitation Network (HME-Net) is proposed from the perspective of accumulated feature-level motion information. An HME module composed of Motion Excitation (ME) and Interval Frame Motion Excitation (IFME) is designed to extract feature-level motion patterns from adjacent frames and interval frames. The HME module can discover and enhance the feature-level motion-sensitive information in the original features. The accumulative time window is expanded to four frames in a hierarchical manner, which achieves the purpose of increasing the receptive field. After extensive experimentation, HME-Net is demonstrated to be able to consistently outperform the existing few-shot video classification models. On the UCF101 and HMDB51 datasets, our method is established as a new state-of-the-art technique for the few-shot settings of five-way three-shot and five-way five-shot video recognition.

Original languageEnglish
Article number1090
JournalElectronics (Switzerland)
Volume12
Issue number5
DOIs
Publication statusPublished - Mar 2023

Keywords

  • few-shot learning
  • meta-learning
  • motion information
  • video recognition

Fingerprint

Dive into the research topics of 'Hierarchical Motion Excitation Network for Few-Shot Video Recognition'. Together they form a unique fingerprint.

Cite this