Abstract
In response to the multiple challenges faced in detecting small infrared moving targets - such as cluttered backgrounds, limited object size, weak feature representation, and low detection precision - this study introduces an enhanced detection model tailored for UAV infrared imagery. The proposed approach, named IFMO-YOLOv11, is built upon the YOLOv11 architecture and specifically optimized to improve the recognition of small infrared objects in aerial scenarios. Firstly, this model utilizes dilated convolution to design the multi-layer feature dilated convolution module (MLF-DC), which replaces the original SPPF layer to enhance the extraction of detailed features in UAV images. Secondly, to strengthen the C3K2 structure, the RFCBAMConv module is incorporated, refining internal convolutional mechanisms and feature integration, thereby boosting the model's feature extraction performance. In addition, the integration of the Biformer module allows the network to better attend to critical details of small targets. To further enhance localization precision, an improved bounding box regression loss function - power-WIoU (P-WIoU), based on the upgraded Wise-IoU v3 - is introduced for more accurate prediction box positioning. Through rigorous evaluation and testing on a publicly available dataset with wide recognition, extensive experimental results demonstrate that the proposed model achieves superior overall performance compared to other popular approaches. Notably, it offers a marked enhancement in detecting small, moving infrared targets.
| Original language | English |
|---|---|
| Pages (from-to) | 54-61 |
| Number of pages | 8 |
| Journal | Youth Academic Annual Conference of Chinese Association of Automation, YAC |
| Issue number | 2025 |
| DOIs | |
| Publication status | Published - 2025 |
| Externally published | Yes |
| Event | 40th Youth Academic Annual Conference of Chinese Association of Automation, YAC 2025 - Zhengzhou, China Duration: 17 May 2025 → 19 May 2025 |
Keywords
- Biformer
- dilated Conv
- object detection
- RFCBAMConv
- WIoU
- YOLOv11
Fingerprint
Dive into the research topics of 'IFMO-YOLOv11: An Infrared Moving Small Object Detection Model Based on YOLOv11'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver