跳到主要导航 跳到搜索 跳到主要内容

A Dual-Branch Network Based on ViT and Mamba for Semantic Segmentation of Remote Sensing Image

  • Ke An
  • , Ying Wang
  • , Liang Chen
  • , Yupie Wang*
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • China Aerospace Science and Technology Corporation

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Semantic segmentation of remote sensing images has significant applications across various scenarios. The prevailing frameworks include Convolutional Neural Network (CNN) and Transformer. However, CNN is limited by the receptive field of convolutions, while the Transformer is constrained by computational complexity, which restricts attention calculations to local windows and fails to effectively address long-range dependency modeling. The efficient Mamba architecture, characterized by linear complexity, offers a promising solution to these challenges. Inspired by Mamba, we propose a dual-branch network based on ViT and Mamba. The Vision Transformer (ViT) branch incorporates the Swin Transformer to model spatial details while maintaining computational complexity within acceptable bounds. Complementarily, the Mamba branch efficiently captures global context and long-range dependencies. Additionally, to suppress noise and conflicting information arising from the fusion of features from different frameworks, we design the Cross-Model Fusion Module (CMFM) and the Cross-Model Relevance Loss (CMRLoss) to achieve semantic consistency in the fusion process. The comprehensive experimental findings on the commonly utilized GaoFen-2 and iSAID datasets clearly illustrate the advantages of our proposed approach compared to the leading methods in the field.

源语言英语
主期刊名IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798331515669
DOI
出版状态已出版 - 2024
活动2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024 - Zhuhai, 中国
期限: 22 11月 202424 11月 2024

出版系列

姓名IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024

会议

会议2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
国家/地区中国
Zhuhai
时期22/11/2424/11/24

指纹

探究 'A Dual-Branch Network Based on ViT and Mamba for Semantic Segmentation of Remote Sensing Image' 的科研主题。它们共同构成独一无二的指纹。

引用此