TY - JOUR
T1 - SD-Net
T2 - Spatial Dual Network for Aerial Object Detection
AU - Gao, Yangte
AU - Bi, Fukun
AU - Chen, Liang
AU - Nie, Xiaoyu
N1 - Publisher Copyright:
© 2023, Indian Society of Remote Sensing.
PY - 2023/10
Y1 - 2023/10
N2 - The distribution direction of aerial objects is arbitrary compared to objects in natural images. However, the existing detectors identify and locate the targets by relying on the shared features, which leads to the contradiction of regression and classification tasks. To be specific, the classifier suppresses rotation-sensitive features, while the regressor relies on rotation-variable features. To address the above contradictions, a Spatial Dual Network (SD-Net) is proposed, which consists of two modules: Polarization Dual Pyramid Module (PDPM) and Spatial Coordinate Attention Module (SCAM). In the SCAM module, to be able to capture channel-related features and global spatial features in different directions, an attention module is built with different convolution kernels that slide in both horizontal and vertical directions. In addition, the polarization function in the Polarization Dual Pyramid Module can split features into features suitable for classification and regression tasks for use in the classifier and regressor of the network, enabling more refined detection. The experimental results on three remote sensing datasets (i.e., DOTA, UCAS-AOD, and HRSC2016) demonstrate that the proposed method achieves higher performance on detection tasks while maintaining high efficiency.
AB - The distribution direction of aerial objects is arbitrary compared to objects in natural images. However, the existing detectors identify and locate the targets by relying on the shared features, which leads to the contradiction of regression and classification tasks. To be specific, the classifier suppresses rotation-sensitive features, while the regressor relies on rotation-variable features. To address the above contradictions, a Spatial Dual Network (SD-Net) is proposed, which consists of two modules: Polarization Dual Pyramid Module (PDPM) and Spatial Coordinate Attention Module (SCAM). In the SCAM module, to be able to capture channel-related features and global spatial features in different directions, an attention module is built with different convolution kernels that slide in both horizontal and vertical directions. In addition, the polarization function in the Polarization Dual Pyramid Module can split features into features suitable for classification and regression tasks for use in the classifier and regressor of the network, enabling more refined detection. The experimental results on three remote sensing datasets (i.e., DOTA, UCAS-AOD, and HRSC2016) demonstrate that the proposed method achieves higher performance on detection tasks while maintaining high efficiency.
KW - Arbitrary-oriented detection
KW - Deep learning
KW - Feature consistency
KW - Remote sensing image
UR - http://www.scopus.com/inward/record.url?scp=85169621803&partnerID=8YFLogxK
U2 - 10.1007/s12524-023-01750-9
DO - 10.1007/s12524-023-01750-9
M3 - Article
AN - SCOPUS:85169621803
SN - 0255-660X
VL - 51
SP - 2067
EP - 2076
JO - Journal of the Indian Society of Remote Sensing
JF - Journal of the Indian Society of Remote Sensing
IS - 10
ER -