An Infrared and Visible Image Fusion Network based on Two-stream Feature Decomposition

Wei Shi, Tong Liu*, Yi Ning Liu, Yi Ke Li

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Infrared and visible image fusion is an important task in areas such as video surveillance and vehicle navigation. It aims to integrate complementary features of the source images to generate a fused image containing salient targets and rich texture details. However, most existing fusion algorithms ignore the fact that infrared and visible modalities have different representations of features at different frequencies. To address the challenges of cross-modal feature decomposition and fusion, this paper proposes a two-stream feature decomposition network for infrared and visible image fusion, called TFDFusion. Firstly, a scene encoder with Transformer as the architecture extracts low-frequency global scene features, while an attribute encoder extracts high-frequency local attribute features using CNN. Secondly, self-attention and cross-attention fusion modules are used to facilitate feature decomposition and adaptive fusion of complementary features, respectively. In addition, we construct the fusion loss using a contrast mask based on variance computation, which guides the fusion network to retain the high-contrast regions in the source image. Extensive experiments show that our TFDFusion achieves satisfactory results in infrared and visible image fusion tasks, outperforming state-of-the-art methods in terms of visual quality.

Original languageEnglish
Title of host publicationProceedings of the 43rd Chinese Control Conference, CCC 2024
EditorsJing Na, Jian Sun
PublisherIEEE Computer Society
Pages7310-7317
Number of pages8
ISBN (Electronic)9789887581581
DOIs
Publication statusPublished - 2024
Event43rd Chinese Control Conference, CCC 2024 - Kunming, China
Duration: 28 Jul 202431 Jul 2024

Publication series

NameChinese Control Conference, CCC
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference43rd Chinese Control Conference, CCC 2024
Country/TerritoryChina
CityKunming
Period28/07/2431/07/24

Keywords

  • Deep learning
  • Feature decomposition
  • Image fusion
  • Transformer

Fingerprint

Dive into the research topics of 'An Infrared and Visible Image Fusion Network based on Two-stream Feature Decomposition'. Together they form a unique fingerprint.

Cite this