TY - GEN
T1 - Advances in UAV Object Detection
T2 - 2024 IEEE International Conference on Unmanned Systems, ICUS 2024
AU - Zhang, Xu
AU - Yang, Chengwei
AU - Tu, Yuanfang
AU - Zhang, Sheng
AU - Liu, Chang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In the Unmanned Aerial Vehicle (UAV) object detection task, small objects constitute a significant portion and are set against a complex environmental background, making feature extraction challenging. Classical detection algorithms exhibit limitations in accurately detecting small objects and fail to meet the accuracy requirements of small objects detection. Therefore, an improved YOLOv8 model for small object detection is proposed. Firstly, this paper introduces a spatial depth conversion convolution module to minimize the loss of small object information typically seen with conventional strided convolutions. This modification helps mitigate feature loss that could otherwise degrade detection performance. Additionally, this paper incorporates an attention mechanism into the backbone network to better concentrate on small objects, thereby improving detection and localization performance. To further optimize performance, a new bounding box regression loss Wise-IoU (WIoU) v3 is adopted. This choice not only enhances the model's generalization capability but also expedites its convergence. Moreover, this paper has designed a new spatial pyramid pooling layer that preserves the texture features of objects while maintaining operational speed. Experimental results are promising, showing that the new method achieves a 47.1% mAP50, which represents a 14.3% improvement over the baseline model. Importantly, these advancements are achieved with only a minimal increase in parameter count.
AB - In the Unmanned Aerial Vehicle (UAV) object detection task, small objects constitute a significant portion and are set against a complex environmental background, making feature extraction challenging. Classical detection algorithms exhibit limitations in accurately detecting small objects and fail to meet the accuracy requirements of small objects detection. Therefore, an improved YOLOv8 model for small object detection is proposed. Firstly, this paper introduces a spatial depth conversion convolution module to minimize the loss of small object information typically seen with conventional strided convolutions. This modification helps mitigate feature loss that could otherwise degrade detection performance. Additionally, this paper incorporates an attention mechanism into the backbone network to better concentrate on small objects, thereby improving detection and localization performance. To further optimize performance, a new bounding box regression loss Wise-IoU (WIoU) v3 is adopted. This choice not only enhances the model's generalization capability but also expedites its convergence. Moreover, this paper has designed a new spatial pyramid pooling layer that preserves the texture features of objects while maintaining operational speed. Experimental results are promising, showing that the new method achieves a 47.1% mAP50, which represents a 14.3% improvement over the baseline model. Importantly, these advancements are achieved with only a minimal increase in parameter count.
KW - Small-Object detection
KW - UAVS
KW - YOLOv8
UR - http://www.scopus.com/inward/record.url?scp=85218053941&partnerID=8YFLogxK
U2 - 10.1109/ICUS61736.2024.10840022
DO - 10.1109/ICUS61736.2024.10840022
M3 - Conference contribution
AN - SCOPUS:85218053941
T3 - Proceedings of 2024 IEEE International Conference on Unmanned Systems, ICUS 2024
SP - 334
EP - 341
BT - Proceedings of 2024 IEEE International Conference on Unmanned Systems, ICUS 2024
A2 - Song, Rong
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 18 October 2024 through 20 October 2024
ER -