Ship Detection Transformer in SAR Images Based on Key Scattering Points Feature Aggregation and Context Feature Refinement

Yifei Yin, Zhu Yang, Hao Shi*, Fanyu Meng, Wei Li

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In recent years, deep learning algorithms have demonstrated significant advancements in the field of ship detection using synthetic aperture radar (SAR). Nevertheless, two primary challenges persist in the task of SAR ship detection: 1) Owing to the unique imaging mechanism, targets in SAR images are typically represented by scattering points, posing challenges for accurate feature extraction and leading to issues of inaccurate localization. 2) The detectors are susceptible to generating false alarms due to interference from the complex backgrounds of inshore scenes. In order to mitigate the issues mentioned above, a ship detection transformer based on key scattering points feature aggregation and context feature refinement is proposed. Specifically, considering that the ship targets exist in the form of scattering points in the SAR images, a Key Scattering Points Feature Aggregation Module is designed to mine and aggregate the key scattering points feature of ship targets. By this method, it is beneficial to generate more accurate feature representation for improving the localization performance of the detectors. Furthermore, to address the issue of excessive false alarms under complex background interference, a Context Feature Refinement Module is designed to augment the semantic representation and context information of feature maps. Extensive experiments are conducted on the two public datasets to substantiate the superiority of our proposed detector compared with other state-of-the-art methods.

Keywords

  • Detection transformer
  • scattering points
  • ship detection
  • synthetic aperture radar (SAR)

Fingerprint

Dive into the research topics of 'Ship Detection Transformer in SAR Images Based on Key Scattering Points Feature Aggregation and Context Feature Refinement'. Together they form a unique fingerprint.

Cite this