CGMRF: CLIP-guide multimodal registration and fusion for a visible–infrared dual-modality imaging system

Wenqu Zhao, Lingxue Wang*, Lian Zhang, Dezhi Zheng, Yi Cai

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In this Letter, we propose CLIP-guided multimodal registration and fusion (CGMRF), a semantic understanding-based multimodal image fusion system, for visible and infrared (IR) dual-modality imaging. CGMRF leverages semantic similarity, better aligned with human visual interpretation, to address the challenges of multimodal image registration and fusion. Experimental results across multiple metrics demonstrate the advantages of the proposed CGMRF system.

Original languageEnglish
Pages (from-to)3907-3910
Number of pages4
JournalOptics Letters
Volume50
Issue number12
DOIs
Publication statusPublished - 15 Jun 2025

Fingerprint

Dive into the research topics of 'CGMRF: CLIP-guide multimodal registration and fusion for a visible–infrared dual-modality imaging system'. Together they form a unique fingerprint.

Cite this