Improved pixel-to-pixel generative method from visible to infrared image

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Thermal infrared (TIR) image technology enables all-weather navigation, however, it poses a challenge in acquiring largescale imagery across regions. To address the challenge, this study proposes an improved cross-modal translation network for generating higher quality TIR images from RGB inputs. Specifically, the paired datasets of RGB and TIR images are first constructed using an image alignment method based on the Radiation-Insensitive Feature Transform (RIFT) algorithm. Subsequently, Improvements are implemented to the pixel-to-pixel generative adversarial network (pix2pix): the transformer module is integrated into the generator architecture to enhance the global feature modeling capability; the conventional normalization method in both encoder and decoder layers is replaced with the Adaptive Layer-Instance Normalization (AdaLIN) to mitigate training instability induced by illumination variations and contrast discrepancies; an infrared-aware multimodal edge loss module is designed to compute edge loss between the generated and real images through the multimodal feature fusion and infrared adaptation design, which is incorporated into the original loss function to guide the edge alignment. The quality of the generated images is comprehensively evaluated using the Structural Similarity Index Measure (SSIM), Peak Signal-to-Noise Ratio (PSNR) metrics and so on, with comparative analyses performed against some existing conversion networks. Experimental and numerical results demonstrate that the proposed method achieves superior performance in preserving image quality, thereby validating the effectiveness of the improved cross-modal conversion framework.

Original languageEnglish
Title of host publicationFifth International Conference on Image Processing and Intelligent Control, IPIC 2025
EditorsHongying Meng, Raffaele Carli, Luis Gomez Deniz
PublisherSPIE
ISBN (Electronic)9781510694439
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event5th International Conference on Image Processing and Intelligent Control, IPIC 2025 - Qingdao, China
Duration: 9 May 202511 May 2025

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13782
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference5th International Conference on Image Processing and Intelligent Control, IPIC 2025
Country/TerritoryChina
CityQingdao
Period9/05/2511/05/25

Keywords

  • Generative Adversarial Network (GAN)
  • Image Alignment
  • Image Translation
  • Scene-Based Navigation

Fingerprint

Dive into the research topics of 'Improved pixel-to-pixel generative method from visible to infrared image'. Together they form a unique fingerprint.

Cite this