ADVANCING CONTROLLABLE DIFFUSION MODEL FOR FEW-SHOT OBJECT DETECTION IN OPTICAL REMOTE SENSING IMAGERY

Tong Zhang, Yin Zhuang*, Xinyi Zhang, Guanqun Wang, He Chen, Fukun Bi

*此作品的通讯作者

科研成果: 会议稿件论文同行评审

摘要

Few-shot object detection (FSOD) from optical remote sensing imagery has to detect rare objects given only a few annotated bounding boxes. The limited training data is hard to represent the data distribution of realistic remote sensing scenes, restricting the performance of FSOD. Recently, learning conditional controls for text-to-image diffusion model has achieved great progress, which is capable of precisely generating the controllable yet imaginational images by text prompt and spatially localized input conditions. Accordingly, in this work, we aim to explore the potential of diffusion model and propose a solution for few-shot object detection by controllable data generation. Firstly, draw upon a few annotated objects, their bounding boxes and categories are respectively used as the spatial conditions and text prompts, then employ them into large text-to-image diffusion models for controlled image generation. Secondly, based the generated images, in order to adapt to the scale and orientation variances of remote sensing objects, a data transformation is devised for boosting the robustness of model training. Finally, some experiments were conducted on public remote sensing dataset DIOR, and the results proved its effectiveness.

源语言英语
7600-7603
页数4
DOI
出版状态已出版 - 2024
活动2024 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2024 - Athens, 希腊
期限: 7 7月 202412 7月 2024

会议

会议2024 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2024
国家/地区希腊
Athens
时期7/07/2412/07/24

指纹

探究 'ADVANCING CONTROLLABLE DIFFUSION MODEL FOR FEW-SHOT OBJECT DETECTION IN OPTICAL REMOTE SENSING IMAGERY' 的科研主题。它们共同构成独一无二的指纹。

引用此