Skip to main navigation Skip to search Skip to main content

Prior-Guided RGB-T Object Detection for UAVs

  • Lingyi Yu
  • , Zhengjie Wang*
  • , Zhaohuan Zhan
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • Shenzhen MSU-BIT University

Research output: Contribution to journalArticlepeer-review

Abstract

RGB-T object detection enhances accuracy and robustness in complex environments by fusing complementary information from visible and thermal infrared images. For practical UAV applications, two core challenges arise: ① intra-modality: visible images captured at night or in bad weather suffer severe degradation and detail loss; ② inter-modality: due to varying perspectives, small targets, and complex backgrounds, target information is hard to align during cross-modal fusion, leading to high noise and difficulty in detecting small targets. To address these, a prior-guided improved scheme for UAVs was proposed. To address the intra-modality problem, a pre-trained low-light enhancement prior was used to enhance low-light RGB images in the spatial domain, restoring details. To address the inter-modality problem, a human attention prior was introduced to design a lightweight foreground discrimination branch, which helped the model focus on target regions via multi-task learning, reducing background noise. Experimental results show that the framework achieves robust detection in complex scenarios with varying illumination and multi-scale targets, providing reliable multi-modal detection support for low-altitude intelligent perception.

Translated title of the contribution先验引导的无人机 RGB-T 目标检测
Original languageEnglish
Pages (from-to)470-479
Number of pages10
JournalBeijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
Volume46
Issue number5
DOIs
Publication statusPublished - 2026
Externally publishedYes

Keywords

  • RGB-T object detection
  • UAV object detection
  • multimodal fusion
  • small object detection

Fingerprint

Dive into the research topics of 'Prior-Guided RGB-T Object Detection for UAVs'. Together they form a unique fingerprint.

Cite this