TY - JOUR
T1 - GLDet
T2 - Real-Time SAR Ship Detector Based on Global Semantic Information Enhancement and Local Gradient Information Mining
AU - Chang, Hao
AU - Fu, Xiongjun
AU - Lang, Ping
AU - Guo, Kunyi
AU - Dong, Jian
AU - Chang, Shibo
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Detecting ships in synthetic aperture radar (SAR) images is a challenging task due to various factors, such as the diverse distribution of ships and the intricate nature of SAR images. In recent years, deep learning has made excellent progress in the field of SAR interpretation. Models that focus on extracting global semantic information can effectively achieve balanced detection of multiscale SAR targets, but their computational complexity is relatively high. Models that focus on processing local information have redundant calculations and poor robustness, but are prone to mistaking the background information of SAR images for targets. To address the above issues, we propose a real-time SAR ship detector based on global semantic information enhancement and local gradient information mining. The lightweight feature extraction backbone based on linear computing is designed, with the network structure of global information augmentation encoder (GIAE), local gradient information miner (LGIM), and decoder, which can quickly perform feature extraction. GIAE enhances the expression of image content through the long-sequence modeling capability of the state-space model (SSM). LGIM uses gradient modules composed of depthwise separable convolutions to extract local information of image and utilizes directed self-attention (DSA) to mine channel context information. Global-local detector (GLDet) can complete object detection, rotated object detection, and instance segmentation tasks by transforming the detection head. Excellent performance has been achieved on the SAR ship instance segmentation datasets: SAR ship detection dataset (SSDD) and high-resolution SAR image dataset (HRSID), as well as the SAR rotated ship datasets: rotated ship detection dataset (RSDD)-SAR and SSDD+. Meanwhile, GLDet demonstrated excellent generalization performance in large-scale SAR images captured by GF-3 and Terra-SAR satellites.
AB - Detecting ships in synthetic aperture radar (SAR) images is a challenging task due to various factors, such as the diverse distribution of ships and the intricate nature of SAR images. In recent years, deep learning has made excellent progress in the field of SAR interpretation. Models that focus on extracting global semantic information can effectively achieve balanced detection of multiscale SAR targets, but their computational complexity is relatively high. Models that focus on processing local information have redundant calculations and poor robustness, but are prone to mistaking the background information of SAR images for targets. To address the above issues, we propose a real-time SAR ship detector based on global semantic information enhancement and local gradient information mining. The lightweight feature extraction backbone based on linear computing is designed, with the network structure of global information augmentation encoder (GIAE), local gradient information miner (LGIM), and decoder, which can quickly perform feature extraction. GIAE enhances the expression of image content through the long-sequence modeling capability of the state-space model (SSM). LGIM uses gradient modules composed of depthwise separable convolutions to extract local information of image and utilizes directed self-attention (DSA) to mine channel context information. Global-local detector (GLDet) can complete object detection, rotated object detection, and instance segmentation tasks by transforming the detection head. Excellent performance has been achieved on the SAR ship instance segmentation datasets: SAR ship detection dataset (SSDD) and high-resolution SAR image dataset (HRSID), as well as the SAR rotated ship datasets: rotated ship detection dataset (RSDD)-SAR and SSDD+. Meanwhile, GLDet demonstrated excellent generalization performance in large-scale SAR images captured by GF-3 and Terra-SAR satellites.
KW - Instance segmentation
KW - lightweight network
KW - multitask learning
KW - real-time detection
KW - rotated object detection
KW - ship detection
KW - synthetic aperture radar (SAR)
UR - http://www.scopus.com/inward/record.url?scp=105003680769&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2025.3559551
DO - 10.1109/TGRS.2025.3559551
M3 - Article
AN - SCOPUS:105003680769
SN - 0196-2892
VL - 63
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5209020
ER -