Cross-Modal Attentive Recalibration and Dynamic Fusion for Multispectral Pedestrian Detection

Wei Bao, Jingjing Hu*, Meiyu Huang, Xueshuang Xiang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Multispectral pedestrian detection can provide accurate and reliable results from color-thermal modalities and has drawn much attention. However, how to effectively capture and leverage complementary information from multiple modalities for superior performance is still a core issue. This paper presents a Cross-Modal Attentive Recalibration and Dynamic Fusion Network (CMRF-Net) to adaptively recalibrate and dynamically fuse multi-modal features from multiple perspectives. CMRF-Net consists of a Cross-modal Attentive Feature Recalibration (CAFR) module and a Multi-Modal Dynamic Feature Fusion (MDFF) module in each feature extraction stage. The CAFR module recalibrates features by fully leveraging local and global complementary information in spatial- and channel-wise dimensions, leading to better cross-modal feature alignment and extraction. The MDFF module adopts dynamically learned convolutions to further exploit complementary information in kernel space, enabling more efficient multi-modal feature aggregation. Extensive experiments are conducted on three multispectral datasets to show the effectiveness and generalization of the proposed method and the state-of-the-art detection performance. Specifically, CMRF-Net can achieve 2.3% mAP gains over the baseline on FLIR dataset.

源语言英语
主期刊名Pattern Recognition and Computer Vision - 6th Chinese Conference, PRCV 2023, Proceedings
编辑Qingshan Liu, Hanzi Wang, Rongrong Ji, Zhanyu Ma, Weishi Zheng, Hongbin Zha, Xilin Chen, Liang Wang
出版商Springer Science and Business Media Deutschland GmbH
499-510
页数12
ISBN(印刷版)9789819984282
DOI
出版状态已出版 - 2024
活动6th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2023 - Xiamen, 中国
期限: 13 10月 202315 10月 2023

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14425 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议6th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2023
国家/地区中国
Xiamen
时期13/10/2315/10/23

指纹

探究 'Cross-Modal Attentive Recalibration and Dynamic Fusion for Multispectral Pedestrian Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此