Skip to main navigation Skip to search Skip to main content

PGMNet: A Prototype-Guided Multimodal Network for Ship Recognition in SAR Images

  • Liang Chen
  • , Jianhao Li
  • , Honghu Zhong
  • , Hao Shi*
  • , Zhu Yang*
  • , Wei Li
  • *Corresponding author for this work
  • Beijing Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

Ship recognition in synthetic aperture radar (SAR) images has extensive applications across various fields. However, the substantial intraclass variability and interclass similarity present inherent challenges to achieving high-precision recognition. Speckle noise in the background reduces the signal-to-noise ratio, complicating the extraction of discriminative characteristics. In addition, traditional convolutional neural network (CNN)-based methods, which rely solely on image processing frameworks without leveraging additional modality information, struggle to accurately represent the features of SAR targets with bright scatters. To address these issues, we propose a prototype-guided multimodal network (PGMNet), marking a pioneering effort to introduce an image–text multimodal fusion processing paradigm into SAR ship recognition. First, a generation strategy of the prototype and the key area is designed to improve the distinguishability between targets and backgrounds. Besides, a prototype-guided alignment module (PGAM) is implemented to assist the network in characterizing key area information, enhancing intraclass feature consistency. Furthermore, a text feature processing branch is incorporated to precisely describe ship size information and effectively integrate image–text multimodal features, reducing intraclass feature distance while enlarging interclass feature distance. Extensive experiments on the OpenSARShip and FUSARShip datasets demonstrate that the proposed PGMNet achieves state-of-the-art (SOTA) performance. Notably, the accuracy of PGMNet is at least 11% higher than the current SOTA algorithms on the OpenSARShip-VI dataset.

Original languageEnglish
Article number5215517
JournalIEEE Transactions on Geoscience and Remote Sensing
Volume63
DOIs
Publication statusPublished - 2025

Keywords

  • Multimodal feature fusion
  • prototype guidance
  • ship recognition
  • synthetic aperture radar (SAR)

Fingerprint

Dive into the research topics of 'PGMNet: A Prototype-Guided Multimodal Network for Ship Recognition in SAR Images'. Together they form a unique fingerprint.

Cite this