TranSTD: A Wavelet-Driven Transformer-Based SAR Target Detection Framework With Adaptive Feature Enhancement and Fusion

  • Bobo Xi
  • , Jiaqi Chen
  • , Yan Huang
  • , Jiaojiao Li
  • , Yunsong Li*
  • , Zan Li*
  • , Xiang Gen Xia
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Target detection in Synthetic Aperture Radar (SAR) images is of great importance in civilian monitoring and military reconnaissance. However, the unique speckle noise inherent in SAR images leads to semantic information loss, while traditional convolutional neural network downsampling methods exacerbate this issue, impacting detection accuracy and robustness. Moreover, some dense target scenarios and weak scattering features of targets make it challenging to achieve sufficient feature discriminability, adding complexity to the detection task. In addition, the multiscale characteristic of SAR targets presents difficulties in balancing detection performance with computational efficiency in complex scenes. To tackle these difficulties, this article introduces a wavelet-driven transformer-based SAR target detection framework called TranSTD. Specifically, it incorporates the Haar wavelet dynamic downsampling and semantic preserving dynamic downsampling modules, which effectively suppress noise and preserve semantic information using techniques such as Haar wavelet denoise and input-driven dynamic pooling downsampling. Furthermore, the SAR adaptive convolution (SAC) bottleneck is proposed for enhancing the discrimination of features. To optimize performance and efficiency across varying scene complexities, a multiscale SAR attention fusion encoder is developed. Extensive experiments are carried out on three datasets, showing that our proposed algorithm outperforms the current state-of-the-art benchmarks in SAR target detection, offering a robust solution for the detection of targets in complex SAR scenes.

Original languageEnglish
Pages (from-to)1197-1211
Number of pages15
JournalIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Volume19
DOIs
Publication statusPublished - 2026
Externally publishedYes

Keywords

  • Dynamic downsampling
  • multiscale SAR attention fusion encoder (MSAF)
  • target detection
  • wavelet denoise

Fingerprint

Dive into the research topics of 'TranSTD: A Wavelet-Driven Transformer-Based SAR Target Detection Framework With Adaptive Feature Enhancement and Fusion'. Together they form a unique fingerprint.

Cite this