Image Editing based on Diffusion Model for Remote Sensing Image Change Captioning

Miaoxin Cai, He Chen*, Can Li, Shuyu Gan, Liang Chen, Yin Zhuang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Remote Sensing Image Change Captioning (RSICC) is a task that utilizes natural language to describe changes in remote sensing images of the same area captured at different times. However, the significant temporal intervals between multi-temporal images, the infrequency of observable changes, and the limitations on observation locations make it difficult to acquire and annotate a large and diverse dataset for analyzing change in multi-temporal images. The scarcity of labeled data hinders the training of RSICC models, leading to poor generalization. Compared to annotated registered bi-temporal image, single-temporal data is easier to obtain. Therefore, to tackle the issue of poor generalization of RSICC models under limited annotated sample conditions, a text-guided image pairs generation (TGIPG) method is proposed to create synthetic RSICC datasets from single-temporal data and randomly sampled text instructions via a diffusion-based controllable image editing model. This approach generates more valid pairs of multi-temporal samples to address the constraints of limited change information. Specifically, this method utilizes language instructions to introduce change information into the diffusion process, gradually transforming the pre-phase image into the post-phase image. Our experiments on the LEVIR-CC dataset show that synthetic data can significantly enhance the performance of any RSICC model, with a restricted number of training samples, by employing this plug-and-play TGIPG method.

源语言英语
主期刊名IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331515669
DOI
出版状态已出版 - 2024
活动2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024 - Zhuhai, 中国
期限: 22 11月 202424 11月 2024

出版系列

姓名IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024

会议

会议2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
国家/地区中国
Zhuhai
时期22/11/2424/11/24

指纹

探究 'Image Editing based on Diffusion Model for Remote Sensing Image Change Captioning' 的科研主题。它们共同构成独一无二的指纹。

引用此

Cai, M., Chen, H., Li, C., Gan, S., Chen, L., & Zhuang, Y. (2024). Image Editing based on Diffusion Model for Remote Sensing Image Change Captioning. 在 IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024 (IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSIDP62679.2024.10868357