TY - JOUR
T1 - SFSANet
T2 - Multiscale Object Detection in Remote Sensing Image Based on Semantic Fusion and Scale Adaptability
AU - Zhang, Yunzuo
AU - Liu, Ting
AU - Yu, Puze
AU - Wang, Shuangshuang
AU - Tao, Ran
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - In the field of computer vision, remote sensing image object detection plays an important role. Although the object detection algorithm has made significant progress, there are still problems in detecting objects with multiscale in remote sensing image. Due to the insufficient utilization of object feature information, the detection accuracy of multiscale objects is very low. To address the aforementioned issues, this article proposes an effective object detection algorithm for remote sensing image based on semantic fusion and scale adaptability (SFSANet). First, in view of the problem that the existing methods ignore the semantic differences between different scale feature maps, the semantic fusion (SF) module is proposed to enrich the semantic information and improve the ability to classify and locate objects. Next, to address the issue of the objects being easily interfered in complex background and the detection performance is poor, the spatial location attention (SLA) module is constructed to suppress background information and make key objects more prominent. Additionally, the scale adaptability (SA) module is designed to enrich the expression of feature information, realize the integration of global and local information, and ensure the integrity of image structure. Finally, we adopt the SIoU loss function as the localization loss to expedite model convergence. In order to verify the effectiveness of the proposed method, we conduct experiments on the mainstream datasets DIOR and NWPU VHR-10, which fully demonstrate the superiority of the proposed method.
AB - In the field of computer vision, remote sensing image object detection plays an important role. Although the object detection algorithm has made significant progress, there are still problems in detecting objects with multiscale in remote sensing image. Due to the insufficient utilization of object feature information, the detection accuracy of multiscale objects is very low. To address the aforementioned issues, this article proposes an effective object detection algorithm for remote sensing image based on semantic fusion and scale adaptability (SFSANet). First, in view of the problem that the existing methods ignore the semantic differences between different scale feature maps, the semantic fusion (SF) module is proposed to enrich the semantic information and improve the ability to classify and locate objects. Next, to address the issue of the objects being easily interfered in complex background and the detection performance is poor, the spatial location attention (SLA) module is constructed to suppress background information and make key objects more prominent. Additionally, the scale adaptability (SA) module is designed to enrich the expression of feature information, realize the integration of global and local information, and ensure the integrity of image structure. Finally, we adopt the SIoU loss function as the localization loss to expedite model convergence. In order to verify the effectiveness of the proposed method, we conduct experiments on the mainstream datasets DIOR and NWPU VHR-10, which fully demonstrate the superiority of the proposed method.
KW - Object detection
KW - receptive field
KW - remote sensing image
KW - semantic fusion (SF)
UR - http://www.scopus.com/inward/record.url?scp=85190358480&partnerID=8YFLogxK
U2 - 10.1109/TGRS.2024.3387572
DO - 10.1109/TGRS.2024.3387572
M3 - Article
AN - SCOPUS:85190358480
SN - 0196-2892
VL - 62
SP - 1
EP - 10
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 4406410
ER -