Optimization for Arbitrary-Oriented Object Detection via Representation Invariance Loss

Qi Ming, Lingjuan Miao, Zhiqiang Zhou*, Xue Yang, Yunpeng Dong

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

75 Citations (Scopus)

Abstract

Arbitrary-oriented objects exist widely in remote sensing images. The mainstream rotation detectors use oriented bounding boxes (OBBs) or quadrilateral bounding boxes (QBBs) to represent the rotating objects. However, these methods suffer from the representation ambiguity for oriented object definition, which leads to suboptimal regression optimization and the inconsistency between the loss metric and the localization accuracy of the predictions. In this letter, we propose a representation invariance loss (RIL) to optimize the bounding box regression for the rotating objects in the remote sensing images. RIL treats multiple representations of an oriented object as multiple equivalent local minima and hence transforms bounding box regression into an adaptive matching process with these local minima. Next, the Hungarian matching algorithm is adopted to obtain the optimal regression strategy. Besides, we propose a normalized rotation loss to alleviate the weak correlation between different variables and their unbalanced loss contribution in OBB representation. Extensive experiments on remote sensing datasets show that our method achieves consistent and substantial improvement. The code and models are available at https://github.com/ming71/RIDet to facilitate future research.

Original languageEnglish
JournalIEEE Geoscience and Remote Sensing Letters
Volume19
DOIs
Publication statusPublished - 2022

Keywords

  • Bounding box regression
  • convolutional neural networks
  • oriented object detection
  • representation ambiguity

Fingerprint

Dive into the research topics of 'Optimization for Arbitrary-Oriented Object Detection via Representation Invariance Loss'. Together they form a unique fingerprint.

Cite this