TY - JOUR
T1 - SHFGAN
T2 - SAR Image Harmonization Fusion with Generative Adversarial Network
AU - Yang, Junyu
AU - Wang, Wenzheng
AU - Deng, Chenwei
N1 - Publisher Copyright:
© 2008-2012 IEEE.
PY - 2026
Y1 - 2026
N2 - Although significant progress has been achieved in synthesizing and simulating synthetic aperture radar (SAR) images using generative models, the random generation of target characteristics and target disappearance during the synthesis process remain major challenges due to the unique nature of SAR images. To address these issues while ensuring harmonious fusion of target patches and backgrounds, this article proposes a generative model named SHFGAN (SAR image harmonization fusion generative adversarial network). Built upon a multiconstraint generative adversarial network framework, the generator of SHFGAN employs a vision transformer model. By introducing attention mechanisms to enhance feature interaction and integrating a semantic embedding module to enforce external semantic constraints, the model effectively resolves unnatural target-background fusion and random generation of target characteristics. In addition, a multitask discriminator design, including a realism discriminator and a mask segmentation discriminator, is adopted to respectively constrain the authenticity of generated images and their semantic segmentation results. Through adversarial training and multiconstraint loss functions, the problem of target disappearance is mitigated. Quantitative and qualitative experiments on the SARDet-100K, SSDD, and AIR-SARship datasets demonstrate that SHFGAN achieves remarkable performance in SAR target patch-background fusion tasks, outperforming common generative models and reaching state-of-the-art levels.
AB - Although significant progress has been achieved in synthesizing and simulating synthetic aperture radar (SAR) images using generative models, the random generation of target characteristics and target disappearance during the synthesis process remain major challenges due to the unique nature of SAR images. To address these issues while ensuring harmonious fusion of target patches and backgrounds, this article proposes a generative model named SHFGAN (SAR image harmonization fusion generative adversarial network). Built upon a multiconstraint generative adversarial network framework, the generator of SHFGAN employs a vision transformer model. By introducing attention mechanisms to enhance feature interaction and integrating a semantic embedding module to enforce external semantic constraints, the model effectively resolves unnatural target-background fusion and random generation of target characteristics. In addition, a multitask discriminator design, including a realism discriminator and a mask segmentation discriminator, is adopted to respectively constrain the authenticity of generated images and their semantic segmentation results. Through adversarial training and multiconstraint loss functions, the problem of target disappearance is mitigated. Quantitative and qualitative experiments on the SARDet-100K, SSDD, and AIR-SARship datasets demonstrate that SHFGAN achieves remarkable performance in SAR target patch-background fusion tasks, outperforming common generative models and reaching state-of-the-art levels.
KW - Generative model
KW - SAR image harmonization fusion generative adversarial network (SHFGAN)
KW - harmonious fusion
KW - synthetic aperture radar (SAR) image simulation
UR - https://www.scopus.com/pages/publications/105039035879
U2 - 10.1109/JSTARS.2026.3692656
DO - 10.1109/JSTARS.2026.3692656
M3 - Article
AN - SCOPUS:105039035879
SN - 1939-1404
VL - 19
SP - 17550
EP - 17565
JO - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
JF - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
ER -