Cross-modal Adaptive Fusion Object Detection Based on Illumination-Awareness

Junwei Xu*, Bo Mo, Jie Zhao, Chunbo Zhao, Yimeng Tao, Shuo Han

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

In recent years, cross-modal object detection has attracted much attention from researchers across various domains. Compared to single-modal detection, cross-modal object detection combines diverse features from distinct modalities, bolstering the reliability and robustness of object detection applications. However, presently proposed cross-modal models exhibit deficiencies in terms of fusion methodologies, resulting them less suitable for the specific task of aerial object detection.To address these challenges, we present the Cross-Modal Adaptive Fusion Network based on Illumination Awareness (CAFN-IA). The Illumination Awareness module is designed to dynamically adjust the position and orientation of GroundTruth-Box (GT-Box) by quantifying light intensity and computing Intersection over Union (IoU) metrics derived from RGB and Infrared images. Additionally, a dual-stream network architecture is developed to extract RGB and Infrared features separately. Moreover, the introduction of an Interest Region Extraction module enhances the extraction of partial regions. Furthermore, we introduce a Cross-Scale Adaptive Fusion module, enhancing the complementarity of distinct features generating from RGB and Infrared images. Notably, our approach involves the modification of the loss function to elevate the accuracy of small object detection.Extensive experimentation and thorough ablation studies demonstrate the efficacy of our method, yielding an accuracy rate surpassing 69% on the DroneVehicle Datasets.

Original languageEnglish
Title of host publicationProceedings - 2024 39th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages931-938
Number of pages8
ISBN (Electronic)9798350379228
DOIs
Publication statusPublished - 2024
Event39th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2024 - Dalian, China
Duration: 7 Jun 20249 Jun 2024

Publication series

NameProceedings - 2024 39th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2024

Conference

Conference39th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2024
Country/TerritoryChina
CityDalian
Period7/06/249/06/24

Keywords

  • Adaptive fusion
  • Cross-modal
  • Illumination awareness

Fingerprint

Dive into the research topics of 'Cross-modal Adaptive Fusion Object Detection Based on Illumination-Awareness'. Together they form a unique fingerprint.

Cite this

Xu, J., Mo, B., Zhao, J., Zhao, C., Tao, Y., & Han, S. (2024). Cross-modal Adaptive Fusion Object Detection Based on Illumination-Awareness. In Proceedings - 2024 39th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2024 (pp. 931-938). (Proceedings - 2024 39th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/YAC63405.2024.10598791