Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion

Research output: Contribution to journalArticlepeer-review

Abstract

Infrared and visible image fusion (IVIF) integrates the thermal saliency of infrared images (IRs) with the structural details of visible images (VIs) to produce comprehensive scene representations. Existing methods often overemphasize one modality, leading to loss of temperature readability or visual details. To address this, we propose Wave-Cross, a wavelet-based fusion framework. Using the discrete wavelet transform (DWT), IR low-frequency sub-bands encode thermal distribution, while VI high-frequency sub-bands capture textural details. Cross-attention adaptively recombines these sub-bands, suppressing modality-specific noise and balancing complementary features. Additionally, we introduce a Heat-Consistency Loss, which enforces pixel-wise thermal ordering and local energy preservation in a self-supervised manner, ensuring the fused image retains IR interpretability while enhancing VI sharpness. Experiments on the TNO, MSRS, and M3FD datasets demonstrate the effectiveness of the proposed method. Compared with state-of-the-art baselines, Wave-Cross achieves superior performance on objective metrics such as SD, AG, SCD, SF, CC, EN, NABF, and MS-SSIM yielding clearer details and more stable thermal saliency under challenging interference conditions. These results highlight the framework’s potential for practical applications in surveillance, autonomous driving, and fault diagnosis.

Original languageEnglish
Article number321
JournalElectronics (Switzerland)
Volume15
Issue number2
DOIs
Publication statusPublished - Jan 2026

Keywords

  • cross-attention
  • heat-consistency
  • image fusion
  • infrared and visible images
  • wavelet transform

Fingerprint

Dive into the research topics of 'Wave-Cross: Balancing Thermal Saliency and Visual Detail in Infrared–Visible Image Fusion'. Together they form a unique fingerprint.

Cite this