MDFNet: Multimodal Feature Decomposition and Fusion Network for Multimodal Remote Sensing Image Semantic Segmentation

Tianyu Wei, He Chen, Jue Wang*, Wenchao Liu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The semantic segmentation of multimodal remote sensing (RS) images, utilizing optical and synthetic aperture radar (SAR) data, has raised attention in recent studies. Advanced studies adaptively modeled and fused modality-share and modality-specific information beneficial for segmentation and achieved competitive performance. However, in challenging scenarios such as low image contrast or blurred textures in optical images and speckle noises or foreshortening in SAR images, modality-specific information may lost. This study proposed a multimodal feature decomposition and fusion network (MDFNet) designed for multimodal RS image semantic segmentation using optical and SAR images. By decomposing multimodal features into modality-share and modality-difference features and applying gradient descent on modality-difference features, modality-specific information beneficial for segmentation can be retained. Specifically, we designed a MDF decoder with MDF blocks. MDF block maps multimodal features into the same feature space and calculates the difference of multimodal features to obtain modality-share and modality-difference features respectively, then fuses these features. MDF decoder optimizes modality-difference features by gradient descent using adaptive modeling, thereby retaining modality-specific information that is beneficial for segmentation. Comprehensive experiments conducted on the DFC20 dataset demonstrated that the proposed MDFNet surpasses representative methods.

Original languageEnglish
Title of host publicationIEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331515669
DOIs
Publication statusPublished - 2024
Event2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024 - Zhuhai, China
Duration: 22 Nov 202424 Nov 2024

Publication series

NameIEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024

Conference

Conference2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
Country/TerritoryChina
CityZhuhai
Period22/11/2424/11/24

Keywords

  • Modality-specific information
  • multimodal remote sensing image semantic segmentation
  • remote sensing
  • synthetic aperture radar (SAR)

Fingerprint

Dive into the research topics of 'MDFNet: Multimodal Feature Decomposition and Fusion Network for Multimodal Remote Sensing Image Semantic Segmentation'. Together they form a unique fingerprint.

Cite this