Object Detection Deployed on UAVs for Oblique Images by Fusing IMU Information

Hao Shen; Defu Lin; Tao Song

doi:10.1109/LGRS.2022.3141109

Object Detection Deployed on UAVs for Oblique Images by Fusing IMU Information

Hao Shen, Defu Lin, Tao Song^*

^*Corresponding author for this work

School of Aerospace Engineering

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

22 Citations (Scopus)

Abstract

Object detection is a very challenging task due to the serious object scale diversity. There is an obvious scale distribution in the oblique images captured by the unmanned aerial vehicles (UAVs): the objects at the top of the image are smaller in scale, while the objects at the bottom of the image are larger in scale. Based on this prior information, we propose an object detector with the divide-and-conquer strategy. First, we estimate the object scale using inertial measurement unit (IMU) information. Then the small objects at the top of image are detected by shallow networks with small receptive field and the big objects at the bottom of image are detected by deep networks with big receptive field. Compared with YOLOv5, our method improves the accuracy by 4.6% on mean of average precision (mAP) metric and improves the speed by 79%. Our method can also be performed in real-time on an NVIDIA XAVIER NX with about 30 frames/s. The code is made available on GitHub (<uri>https://github.com/bitshenwenxiao/UAVYOLO</uri>).

Original language	English
Journal	IEEE Geoscience and Remote Sensing Letters
Volume	19
DOIs	https://doi.org/10.1109/LGRS.2022.3141109
Publication status	Published - 2022

Keywords

Automobiles
Cameras
Decoding
Detectors
Feature extraction
Object detection
Task analysis

Access to Document

10.1109/LGRS.2022.3141109

Cite this

Shen, H., Lin, D., & Song, T. (2022). Object Detection Deployed on UAVs for Oblique Images by Fusing IMU Information. IEEE Geoscience and Remote Sensing Letters, 19. https://doi.org/10.1109/LGRS.2022.3141109

@article{f7545a76a86e43af891a44714857575d,

title = "Object Detection Deployed on UAVs for Oblique Images by Fusing IMU Information",

abstract = "Object detection is a very challenging task due to the serious object scale diversity. There is an obvious scale distribution in the oblique images captured by the unmanned aerial vehicles (UAVs): the objects at the top of the image are smaller in scale, while the objects at the bottom of the image are larger in scale. Based on this prior information, we propose an object detector with the divide-and-conquer strategy. First, we estimate the object scale using inertial measurement unit (IMU) information. Then the small objects at the top of image are detected by shallow networks with small receptive field and the big objects at the bottom of image are detected by deep networks with big receptive field. Compared with YOLOv5, our method improves the accuracy by 4.6% on mean of average precision (mAP) metric and improves the speed by 79%. Our method can also be performed in real-time on an NVIDIA XAVIER NX with about 30 frames/s. The code is made available on GitHub (https://github.com/bitshenwenxiao/UAVYOLO).",

keywords = "Automobiles, Cameras, Decoding, Detectors, Feature extraction, Object detection, Task analysis",

author = "Hao Shen and Defu Lin and Tao Song",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.",

year = "2022",

doi = "10.1109/LGRS.2022.3141109",

language = "English",

volume = "19",

journal = "IEEE Geoscience and Remote Sensing Letters",

issn = "1545-598X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Object Detection Deployed on UAVs for Oblique Images by Fusing IMU Information

AU - Shen, Hao

AU - Lin, Defu

AU - Song, Tao

PY - 2022

Y1 - 2022

N2 - Object detection is a very challenging task due to the serious object scale diversity. There is an obvious scale distribution in the oblique images captured by the unmanned aerial vehicles (UAVs): the objects at the top of the image are smaller in scale, while the objects at the bottom of the image are larger in scale. Based on this prior information, we propose an object detector with the divide-and-conquer strategy. First, we estimate the object scale using inertial measurement unit (IMU) information. Then the small objects at the top of image are detected by shallow networks with small receptive field and the big objects at the bottom of image are detected by deep networks with big receptive field. Compared with YOLOv5, our method improves the accuracy by 4.6% on mean of average precision (mAP) metric and improves the speed by 79%. Our method can also be performed in real-time on an NVIDIA XAVIER NX with about 30 frames/s. The code is made available on GitHub (https://github.com/bitshenwenxiao/UAVYOLO).

AB - Object detection is a very challenging task due to the serious object scale diversity. There is an obvious scale distribution in the oblique images captured by the unmanned aerial vehicles (UAVs): the objects at the top of the image are smaller in scale, while the objects at the bottom of the image are larger in scale. Based on this prior information, we propose an object detector with the divide-and-conquer strategy. First, we estimate the object scale using inertial measurement unit (IMU) information. Then the small objects at the top of image are detected by shallow networks with small receptive field and the big objects at the bottom of image are detected by deep networks with big receptive field. Compared with YOLOv5, our method improves the accuracy by 4.6% on mean of average precision (mAP) metric and improves the speed by 79%. Our method can also be performed in real-time on an NVIDIA XAVIER NX with about 30 frames/s. The code is made available on GitHub (https://github.com/bitshenwenxiao/UAVYOLO).

KW - Automobiles

KW - Cameras

KW - Decoding

KW - Detectors

KW - Feature extraction

KW - Object detection

KW - Task analysis

UR - http://www.scopus.com/inward/record.url?scp=85122877353&partnerID=8YFLogxK

U2 - 10.1109/LGRS.2022.3141109

DO - 10.1109/LGRS.2022.3141109

M3 - Article

AN - SCOPUS:85122877353

SN - 1545-598X

VL - 19

JO - IEEE Geoscience and Remote Sensing Letters

JF - IEEE Geoscience and Remote Sensing Letters

ER -

Object Detection Deployed on UAVs for Oblique Images by Fusing IMU Information

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this