TY - GEN
T1 - CoT-YOLOv8
T2 - 2023 China Automation Congress, CAC 2023
AU - Wang, Yuhe
AU - Pan, Feng
AU - Li, Zhenxu
AU - Xin, Xiuli
AU - Li, Weixing
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Detecting small targets in aerial images using unmanned aerial vehicles is an important research direction in the field of object detection and a highly challenging task. However, existing object detection methods often suffer from high miss rates and false alarm rates in the task of detecting targets in aerial images. To address this issue, we propose an algorithm called CoT - YOLOv8 to improve small target detection in aerial images. Firstly, we add an additional detection layer to the YOLOv8 algorithm to enhance the detection capability for small target objects. Secondly, we insert multiple Convolutional Block Attention Module (CBAM) into the Backbone network to focus more on useful information, thereby improving the detection capability in complex scenes. Additionally, we replace the standard convolutional network in the Backbone network with a Dynamic Convolution Module (DCN), enabling the model to better adapt to geometric variations of the targets. Finally, we introduce the Contextual Transformer module into the Head network, allowing the model to utilize contextual information to assist in object detection and further improve the detection accuracy. The improved algorithm shows an increase of 7.7%, 7.2 %, and 8.7% in precision (P), recall rate (R), and average precision (IOU-O.5) respectively. This indicates that the CoT - YOLOv8 algorithm has better generalization capability and higher detection accuracy compared to the original YOLOv8 in aerial small target scenarios.
AB - Detecting small targets in aerial images using unmanned aerial vehicles is an important research direction in the field of object detection and a highly challenging task. However, existing object detection methods often suffer from high miss rates and false alarm rates in the task of detecting targets in aerial images. To address this issue, we propose an algorithm called CoT - YOLOv8 to improve small target detection in aerial images. Firstly, we add an additional detection layer to the YOLOv8 algorithm to enhance the detection capability for small target objects. Secondly, we insert multiple Convolutional Block Attention Module (CBAM) into the Backbone network to focus more on useful information, thereby improving the detection capability in complex scenes. Additionally, we replace the standard convolutional network in the Backbone network with a Dynamic Convolution Module (DCN), enabling the model to better adapt to geometric variations of the targets. Finally, we introduce the Contextual Transformer module into the Head network, allowing the model to utilize contextual information to assist in object detection and further improve the detection accuracy. The improved algorithm shows an increase of 7.7%, 7.2 %, and 8.7% in precision (P), recall rate (R), and average precision (IOU-O.5) respectively. This indicates that the CoT - YOLOv8 algorithm has better generalization capability and higher detection accuracy compared to the original YOLOv8 in aerial small target scenarios.
KW - Aerial Images
KW - CBAM
KW - Contextual Transformer
KW - Small Object Detection
KW - YOLOv8
UR - http://www.scopus.com/inward/record.url?scp=85189284440&partnerID=8YFLogxK
U2 - 10.1109/CAC59555.2023.10451989
DO - 10.1109/CAC59555.2023.10451989
M3 - Conference contribution
AN - SCOPUS:85189284440
T3 - Proceedings - 2023 China Automation Congress, CAC 2023
SP - 4943
EP - 4948
BT - Proceedings - 2023 China Automation Congress, CAC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 17 November 2023 through 19 November 2023
ER -