跳到主要导航 跳到搜索 跳到主要内容

Improved pixel-to-pixel generative method from visible to infrared image

  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Thermal infrared (TIR) image technology enables all-weather navigation, however, it poses a challenge in acquiring largescale imagery across regions. To address the challenge, this study proposes an improved cross-modal translation network for generating higher quality TIR images from RGB inputs. Specifically, the paired datasets of RGB and TIR images are first constructed using an image alignment method based on the Radiation-Insensitive Feature Transform (RIFT) algorithm. Subsequently, Improvements are implemented to the pixel-to-pixel generative adversarial network (pix2pix): the transformer module is integrated into the generator architecture to enhance the global feature modeling capability; the conventional normalization method in both encoder and decoder layers is replaced with the Adaptive Layer-Instance Normalization (AdaLIN) to mitigate training instability induced by illumination variations and contrast discrepancies; an infrared-aware multimodal edge loss module is designed to compute edge loss between the generated and real images through the multimodal feature fusion and infrared adaptation design, which is incorporated into the original loss function to guide the edge alignment. The quality of the generated images is comprehensively evaluated using the Structural Similarity Index Measure (SSIM), Peak Signal-to-Noise Ratio (PSNR) metrics and so on, with comparative analyses performed against some existing conversion networks. Experimental and numerical results demonstrate that the proposed method achieves superior performance in preserving image quality, thereby validating the effectiveness of the improved cross-modal conversion framework.

源语言英语
主期刊名Fifth International Conference on Image Processing and Intelligent Control, IPIC 2025
编辑Hongying Meng, Raffaele Carli, Luis Gomez Deniz
出版商SPIE
ISBN(电子版)9781510694439
DOI
出版状态已出版 - 2025
已对外发布
活动5th International Conference on Image Processing and Intelligent Control, IPIC 2025 - Qingdao, 中国
期限: 9 5月 202511 5月 2025

出版系列

姓名Proceedings of SPIE - The International Society for Optical Engineering
13782
ISSN(印刷版)0277-786X
ISSN(电子版)1996-756X

会议

会议5th International Conference on Image Processing and Intelligent Control, IPIC 2025
国家/地区中国
Qingdao
时期9/05/2511/05/25

指纹

探究 'Improved pixel-to-pixel generative method from visible to infrared image' 的科研主题。它们共同构成独一无二的指纹。

引用此