TY - JOUR
T1 - Toward Efficient Object Detection in Aerial Images Using Extreme Scale Metric Learning
AU - Jin, Ren
AU - Lv, Junning
AU - Li, Bin
AU - Ye, Jianchuan
AU - Lin, Defu
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2021
Y1 - 2021
N2 - In aerial image object detection, how to efficiently detect different size objects in input images of different scales and obtain a unified multi-scale representation of the object is an important issue. Existing methods rarely consider the connection between multi-scale training and multi-scale inference, and do not well optimize the constraint of input object samples in the multi-scale training process, which limits the performance of multi-scale representation. In this study, an efficient object detection algorithm for aerial images is proposed to alleviate this problem. Firstly, we propose to use metric learning to obtain the scale representation boundary of each object class, reduce the support of indistinguishable objects at extreme scales in the training process, and enhance the effect of multi-scale representation. Secondly, indistinguishable small objects are merged into small object regions, and these regions are trained to recommend the detector to detect small objects on the following high-resolution scale. Thus, a reasonable association between multi-scale training and inference is established, and the efficiency of multi-scale inference is considerably improved. The proposed algorithm has been tested on three popular aerial image datasets, including VisDrone, DOTA and UAVDT. Experimental results show that it can improve the detection accuracy and reduce the number of processing pixels.
AB - In aerial image object detection, how to efficiently detect different size objects in input images of different scales and obtain a unified multi-scale representation of the object is an important issue. Existing methods rarely consider the connection between multi-scale training and multi-scale inference, and do not well optimize the constraint of input object samples in the multi-scale training process, which limits the performance of multi-scale representation. In this study, an efficient object detection algorithm for aerial images is proposed to alleviate this problem. Firstly, we propose to use metric learning to obtain the scale representation boundary of each object class, reduce the support of indistinguishable objects at extreme scales in the training process, and enhance the effect of multi-scale representation. Secondly, indistinguishable small objects are merged into small object regions, and these regions are trained to recommend the detector to detect small objects on the following high-resolution scale. Thus, a reasonable association between multi-scale training and inference is established, and the efficiency of multi-scale inference is considerably improved. The proposed algorithm has been tested on three popular aerial image datasets, including VisDrone, DOTA and UAVDT. Experimental results show that it can improve the detection accuracy and reduce the number of processing pixels.
KW - Aerial images
KW - extreme scale
KW - metric learning
KW - object detection
UR - http://www.scopus.com/inward/record.url?scp=85104205813&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2021.3072067
DO - 10.1109/ACCESS.2021.3072067
M3 - Article
AN - SCOPUS:85104205813
SN - 2169-3536
VL - 9
SP - 56214
EP - 56227
JO - IEEE Access
JF - IEEE Access
M1 - 9399436
ER -