Attention-Guided Multi-modal and Multi-scale Fusion for Multispectral Pedestrian Detection

Wei Bao, Meiyu Huang*, Jingjing Hu, Xueshuang Xiang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Multispectral pedestrian detection provides more accurate and reliable detection results by leveraging complementary information from color-thermal modalities and has drawn much attention in the open world. Much progress has been made in the feature-level-based detection methods which aim to effectively fuse the multispectral features extracted by the convolution neural networks. However, existing methods mainly focus on the information integration between the same-level feature maps and ignore the complementary local features scattered in multi-scale layers. In this paper, we introduce an Attention-guided multi-Modal and multi-Scale Fusion (AMSF) module to simultaneously sample complementary local features scattered in multi-modal and multi-scale layers, and adaptively aggregate them with fine-grained attention to fully exploit different modalities for better multi-scale detection results. Extensive experiments are conducted on three multispectral datasets and three representative deep-learning-based detection benchmarks to show the effectiveness and generalization of the proposed method, and the state-of-the-art detection performance.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision - 5th Chinese Conference, PRCV 2022, Proceedings
EditorsShiqi Yu, Jianguo Zhang, Zhaoxiang Zhang, Tieniu Tan, Pong C. Yuen, Yike Guo, Junwei Han, Jianhuang Lai
PublisherSpringer Science and Business Media Deutschland GmbH
Pages382-393
Number of pages12
ISBN (Print)9783031189067
DOIs
Publication statusPublished - 2022
Event5th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2022 - Shenzhen, China
Duration: 4 Nov 20227 Nov 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13534 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2022
Country/TerritoryChina
CityShenzhen
Period4/11/227/11/22

Keywords

  • Fine-grained attention
  • Multi-modal and Multi-scale fusion
  • Multispectral pedestrian detection

Fingerprint

Dive into the research topics of 'Attention-Guided Multi-modal and Multi-scale Fusion for Multispectral Pedestrian Detection'. Together they form a unique fingerprint.

Cite this