ARNets: Action Recurrent Networks for Human Action Recognition

Guangjun Zhang, Xiaobo Cai, Guangyu Gao*, Zihua Yan, Liang Shu, Zhihui Hu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recurrent neural networks (RNNs) are powerful and robust neural networks that capture time dynamics via cycles in the graph. Compared to Convolutional Neural Networks (CNNs), RNNs consider both the current and the historical inputs which leads them to the standard approach for practitioners to address machine learning tasks involving sequential data. Compared to the achievements in these sequential learning tasks, RNNs pale in presenting a promising performance on human action recognition. Do we miss something? In this paper, we will conduct a detailed investigation of the RNNs for the human action recognition task. Several alternative recurrent architectures are designed and compared. Even simply, we find that adding the supervision signal at the end of an RNN can provide us with a comparable if not better performance than the previous state-of-the-art. We further analyze the different combinations of modalities and the effect of the length of the sequence. Compared to CNN-based recognition architecture, RNNs focus more on the action itself. Going deeper, we also find that when modeling long and difficult actions, RNNs perform much better than CNNs. It means a complex video action dataset may be required.

Original languageEnglish
Title of host publicationICIGP 2023 - Proceedings of the 6th International Conference on Image and Graphics Processing
PublisherAssociation for Computing Machinery
Pages8-15
Number of pages8
ISBN (Electronic)9781450398572
DOIs
Publication statusPublished - 6 Jan 2023
Event6th International Conference on Image and Graphics Processing, ICIGP 2023 - Chongqing, China
Duration: 6 Jan 20238 Jan 2023

Publication series

NameACM International Conference Proceeding Series

Conference

Conference6th International Conference on Image and Graphics Processing, ICIGP 2023
Country/TerritoryChina
CityChongqing
Period6/01/238/01/23

Keywords

  • action recognition
  • recurrent networks
  • visualization

Fingerprint

Dive into the research topics of 'ARNets: Action Recurrent Networks for Human Action Recognition'. Together they form a unique fingerprint.

Cite this