RSTS-YOLOv5: An Improved Object Detector for Drone-Captured Images

Juan Xiu Liu, Jiachen Li, Ruqian Hao, Yanlong Yang, Jing Ming Zhang, Xiangzhou Wang, Guoming Lu, Ping Zhang, Jing zhang, Yong Liu, Lin Liu, Xingguo Wang, Hao Deng, Dongdong Wang, Xiaohui Du*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Despite the tremendous progress in object detection in recent years, object detection on drone-captured images is still a great challenge because of the large number of small objects that appear densely and obscure each other in drone-captured images. In order to solve the problem of difficult object detection on drone-captured images, we propose a robust and efficient deep learning network RSTS-YOLOv5. We constructed Res swin transformer stage (RSTS) based on Swin-Transformer stage to extract global and contextual information and embedded it in YOLOv5x to explore the position of the transformer-based structure added in the detection network. In addition, we propose a multi-scale data augmentation for object detection on drone-captured images, which can enhance the robustness of the model for different scale objects without introducing additional computations. Experimental results show that our proposed RSTS-YOLOv5 achieves a mAP of 34.72% on the VisDrone test-dev subset and 34.84% on the validation-dev subset. Specifically, RSTS-YOLOv5 generalizes well on various drone-captured scenes, and is extremely competitive in object detection tasks on drone-captured images.

源语言英语
主期刊名Proceedings of 2023 11th China Conference on Command and Control
出版商Springer Science and Business Media Deutschland GmbH
355-366
页数12
ISBN(印刷版)9789819990207
DOI
出版状态已出版 - 2024
已对外发布
活动11th China Conference on Command and Control, C2 2023 - Beijing, 中国
期限: 24 10月 202325 10月 2023

出版系列

姓名Lecture Notes in Electrical Engineering
1124 LNEE
ISSN(印刷版)1876-1100
ISSN(电子版)1876-1119

会议

会议11th China Conference on Command and Control, C2 2023
国家/地区中国
Beijing
时期24/10/2325/10/23

指纹

探究 'RSTS-YOLOv5: An Improved Object Detector for Drone-Captured Images' 的科研主题。它们共同构成独一无二的指纹。

引用此