ARNets: Action Recurrent Networks for Human Action Recognition

Guangjun Zhang, Xiaobo Cai, Guangyu Gao*, Zihua Yan, Liang Shu, Zhihui Hu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Recurrent neural networks (RNNs) are powerful and robust neural networks that capture time dynamics via cycles in the graph. Compared to Convolutional Neural Networks (CNNs), RNNs consider both the current and the historical inputs which leads them to the standard approach for practitioners to address machine learning tasks involving sequential data. Compared to the achievements in these sequential learning tasks, RNNs pale in presenting a promising performance on human action recognition. Do we miss something? In this paper, we will conduct a detailed investigation of the RNNs for the human action recognition task. Several alternative recurrent architectures are designed and compared. Even simply, we find that adding the supervision signal at the end of an RNN can provide us with a comparable if not better performance than the previous state-of-the-art. We further analyze the different combinations of modalities and the effect of the length of the sequence. Compared to CNN-based recognition architecture, RNNs focus more on the action itself. Going deeper, we also find that when modeling long and difficult actions, RNNs perform much better than CNNs. It means a complex video action dataset may be required.

源语言英语
主期刊名ICIGP 2023 - Proceedings of the 6th International Conference on Image and Graphics Processing
出版商Association for Computing Machinery
8-15
页数8
ISBN(电子版)9781450398572
DOI
出版状态已出版 - 6 1月 2023
活动6th International Conference on Image and Graphics Processing, ICIGP 2023 - Chongqing, 中国
期限: 6 1月 20238 1月 2023

出版系列

姓名ACM International Conference Proceeding Series

会议

会议6th International Conference on Image and Graphics Processing, ICIGP 2023
国家/地区中国
Chongqing
时期6/01/238/01/23

指纹

探究 'ARNets: Action Recurrent Networks for Human Action Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此