摘要
Drone-based RGBT person detection, which has garnered significant attention, seamlessly integrates the spatial flexibility of drones with the around-the-clock time flexibility of RGBT data for continuous information acquisition. However, the drone-captured images cover a wide range of areas, resulting in illumination imbalance and thermal background clutter issues in RGB and thermal images. The varying quality of the two modalities in different regions of an RGBT image set increases the difficulty of complementary information fusion, thereby leading to false negatives in the advanced drone-based RGBT tiny person detection methods. In this context, a novel Cross-modal Complementary Region-aware framework for drone-based RGBT tiny person Detection (CCRDet) is proposed for effective object detection under poorly illuminated RGB regions and thermally cluttered backgrounds. CCRDet employs the proposed cross-modal region-aware guidance to be aware of these regions and guide the counterpart modality to enhance valid target features accordingly. After that, it leverages the proposed modality-difference feature gated fusion to deliver these valid target features to the fused features with effective preservation, thereby enhancing their response intensity after fusion and providing high-quality inputs for the detection head. Extensive experiments on two drone-based RGBT tiny person detection datasets, RGBTDronePerson and VTUAV-det, demonstrate the effectiveness of the proposed method. The code is available at https://github.com/G-pz/CCRDet .
| 源语言 | 英语 |
|---|---|
| 文章编号 | 104408 |
| 期刊 | Information Fusion |
| 卷 | 135 |
| DOI | |
| 出版状态 | 已出版 - 11月 2026 |
指纹
探究 'CCRDet: Cross-modal complementary region-aware framework for drone-based RGBT tiny person detection' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver