An Action Recognition Algorithm Based on Two-Stream Deep Learning for Metaverse Applications

Jiayue Liu, Tianqi Mao*, Yicheng Huang, Dongxuan He*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Action recognition algorithms have gained significant attention in recent years, which can be indispensable for a plethora of cutting-edge applications like extended reality or Metaverse. These services often pose stringent requirement on immediate sensing and cognition of the surroundings, which necessitates immediate classifications of the captured actions (e.g., video data) that classical signal processing methods can hardly attain. In this paper, we introduced a residual artificial neural network with two-stream structure to further improve the accuracy of action recognition algorithm. Specifically, two residual networks (ResNet101) are trained separately, one by spatial RGB image streams, and another by optical flow streams. The two-strem network outputs are then fed into a fusion classifier, in which information extracted by spatial network and temporal network jointly determines the classification result. Moreover, in the training process, hyper-parameters setting and optimizer selection are performed numerically to achieve optimal performance. Finally, the recognition accuracy of the proposed algorithm has been compared to other existing widely-employed counterparts, where UCF101 data set is utilized for training and testing. Simulations validates aiming that the network can achieve higher recognition accuracy than traditional algorithms, and the two-stream method shows its superiority over the single-network counterpart.

Original languageEnglish
Title of host publication20th International Wireless Communications and Mobile Computing Conference, IWCMC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages639-642
Number of pages4
ISBN (Electronic)9798350361261
DOIs
Publication statusPublished - 2024
Event20th IEEE International Wireless Communications and Mobile Computing Conference, IWCMC 2024 - Hybrid, Ayia Napa, Cyprus
Duration: 27 May 202431 May 2024

Publication series

Name20th International Wireless Communications and Mobile Computing Conference, IWCMC 2024

Conference

Conference20th IEEE International Wireless Communications and Mobile Computing Conference, IWCMC 2024
Country/TerritoryCyprus
CityHybrid, Ayia Napa
Period27/05/2431/05/24

Keywords

  • Action recognition
  • deep learning
  • residual network
  • ResNet101
  • two-stream method

Fingerprint

Dive into the research topics of 'An Action Recognition Algorithm Based on Two-Stream Deep Learning for Metaverse Applications'. Together they form a unique fingerprint.

Cite this