Optimization for Arbitrary-Oriented Object Detection via Representation Invariance Loss

Qi Ming, Lingjuan Miao, Zhiqiang Zhou*, Xue Yang, Yunpeng Dong

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

64 引用 (Scopus)

摘要

Arbitrary-oriented objects exist widely in remote sensing images. The mainstream rotation detectors use oriented bounding boxes (OBBs) or quadrilateral bounding boxes (QBBs) to represent the rotating objects. However, these methods suffer from the representation ambiguity for oriented object definition, which leads to suboptimal regression optimization and the inconsistency between the loss metric and the localization accuracy of the predictions. In this letter, we propose a representation invariance loss (RIL) to optimize the bounding box regression for the rotating objects in the remote sensing images. RIL treats multiple representations of an oriented object as multiple equivalent local minima and hence transforms bounding box regression into an adaptive matching process with these local minima. Next, the Hungarian matching algorithm is adopted to obtain the optimal regression strategy. Besides, we propose a normalized rotation loss to alleviate the weak correlation between different variables and their unbalanced loss contribution in OBB representation. Extensive experiments on remote sensing datasets show that our method achieves consistent and substantial improvement. The code and models are available at https://github.com/ming71/RIDet to facilitate future research.

源语言英语
期刊IEEE Geoscience and Remote Sensing Letters
19
DOI
出版状态已出版 - 2022

指纹

探究 'Optimization for Arbitrary-Oriented Object Detection via Representation Invariance Loss' 的科研主题。它们共同构成独一无二的指纹。

引用此