Application of Attention Mechanism-Based Dual-Modality SSD in RGB-D Hand Detection

Xiangjie Zhu, Baokui Li, Qing Fei, Qiang Wang, Haolin Jia

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Multimodal gesture recognition is a crucial research area in human-computer interaction. This paper proposes a static gesture multimodal recognition technology based on the Single Shot MultiBox Detector (SSD). Firstly, RGB image data and Depth image data are input into the VGG network to extract features. Then, trained features are concatenated in the fusion process, and the weights of features are adaptively learned with attention mechanisms. Results show that combining the two modalities improves model accuracy compared to using RGB images and Depth images separately. Next, the VGG network is replaced with the MobileNet v1 network as the backbone to make the model faster. The proposed method is tested on the Hand Gesture Dataset. The results indicate that the proposed method is superior to the single-modal gesture recognition SSD network.

源语言英语
主期刊名2023 42nd Chinese Control Conference, CCC 2023
出版商IEEE Computer Society
7811-7816
页数6
ISBN(电子版)9789887581543
DOI
出版状态已出版 - 2023
活动42nd Chinese Control Conference, CCC 2023 - Tianjin, 中国
期限: 24 7月 202326 7月 2023

出版系列

姓名Chinese Control Conference, CCC
2023-July
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议42nd Chinese Control Conference, CCC 2023
国家/地区中国
Tianjin
时期24/07/2326/07/23

指纹

探究 'Application of Attention Mechanism-Based Dual-Modality SSD in RGB-D Hand Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此