Attention-Guided Multi-modal and Multi-scale Fusion for Multispectral Pedestrian Detection

Wei Bao, Meiyu Huang*, Jingjing Hu, Xueshuang Xiang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Multispectral pedestrian detection provides more accurate and reliable detection results by leveraging complementary information from color-thermal modalities and has drawn much attention in the open world. Much progress has been made in the feature-level-based detection methods which aim to effectively fuse the multispectral features extracted by the convolution neural networks. However, existing methods mainly focus on the information integration between the same-level feature maps and ignore the complementary local features scattered in multi-scale layers. In this paper, we introduce an Attention-guided multi-Modal and multi-Scale Fusion (AMSF) module to simultaneously sample complementary local features scattered in multi-modal and multi-scale layers, and adaptively aggregate them with fine-grained attention to fully exploit different modalities for better multi-scale detection results. Extensive experiments are conducted on three multispectral datasets and three representative deep-learning-based detection benchmarks to show the effectiveness and generalization of the proposed method, and the state-of-the-art detection performance.

源语言英语
主期刊名Pattern Recognition and Computer Vision - 5th Chinese Conference, PRCV 2022, Proceedings
编辑Shiqi Yu, Jianguo Zhang, Zhaoxiang Zhang, Tieniu Tan, Pong C. Yuen, Yike Guo, Junwei Han, Jianhuang Lai
出版商Springer Science and Business Media Deutschland GmbH
382-393
页数12
ISBN(印刷版)9783031189067
DOI
出版状态已出版 - 2022
活动5th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2022 - Shenzhen, 中国
期限: 4 11月 20227 11月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13534 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议5th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2022
国家/地区中国
Shenzhen
时期4/11/227/11/22

指纹

探究 'Attention-Guided Multi-modal and Multi-scale Fusion for Multispectral Pedestrian Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此