SFSANet: Multiscale Object Detection in Remote Sensing Image Based on Semantic Fusion and Scale Adaptability

Yunzuo Zhang; Ting Liu; Puze Yu; Shuangshuang Wang; Ran Tao

doi:10.1109/TGRS.2024.3387572

SFSANet: Multiscale Object Detection in Remote Sensing Image Based on Semantic Fusion and Scale Adaptability

Yunzuo Zhang^*, Ting Liu, Puze Yu, Shuangshuang Wang, Ran Tao

^*Corresponding author for this work

Institute of Advanced Structure Technology

Shijiazhuang Tiedao University

Research output: Contribution to journal › Article › peer-review

22 Citations (Scopus)

Abstract

In the field of computer vision, remote sensing image object detection plays an important role. Although the object detection algorithm has made significant progress, there are still problems in detecting objects with multiscale in remote sensing image. Due to the insufficient utilization of object feature information, the detection accuracy of multiscale objects is very low. To address the aforementioned issues, this article proposes an effective object detection algorithm for remote sensing image based on semantic fusion and scale adaptability (SFSANet). First, in view of the problem that the existing methods ignore the semantic differences between different scale feature maps, the semantic fusion (SF) module is proposed to enrich the semantic information and improve the ability to classify and locate objects. Next, to address the issue of the objects being easily interfered in complex background and the detection performance is poor, the spatial location attention (SLA) module is constructed to suppress background information and make key objects more prominent. Additionally, the scale adaptability (SA) module is designed to enrich the expression of feature information, realize the integration of global and local information, and ensure the integrity of image structure. Finally, we adopt the SIoU loss function as the localization loss to expedite model convergence. In order to verify the effectiveness of the proposed method, we conduct experiments on the mainstream datasets DIOR and NWPU VHR-10, which fully demonstrate the superiority of the proposed method.

Original language	English
Article number	4406410
Pages (from-to)	1-10
Number of pages	10
Journal	IEEE Transactions on Geoscience and Remote Sensing
Volume	62
DOIs	https://doi.org/10.1109/TGRS.2024.3387572
Publication status	Published - 2024

Keywords

Object detection
receptive field
remote sensing image
semantic fusion (SF)

Access to Document

10.1109/TGRS.2024.3387572

Cite this

@article{3b8a612156b94d4f88cb6352f610f360,

title = "SFSANet: Multiscale Object Detection in Remote Sensing Image Based on Semantic Fusion and Scale Adaptability",

abstract = "In the field of computer vision, remote sensing image object detection plays an important role. Although the object detection algorithm has made significant progress, there are still problems in detecting objects with multiscale in remote sensing image. Due to the insufficient utilization of object feature information, the detection accuracy of multiscale objects is very low. To address the aforementioned issues, this article proposes an effective object detection algorithm for remote sensing image based on semantic fusion and scale adaptability (SFSANet). First, in view of the problem that the existing methods ignore the semantic differences between different scale feature maps, the semantic fusion (SF) module is proposed to enrich the semantic information and improve the ability to classify and locate objects. Next, to address the issue of the objects being easily interfered in complex background and the detection performance is poor, the spatial location attention (SLA) module is constructed to suppress background information and make key objects more prominent. Additionally, the scale adaptability (SA) module is designed to enrich the expression of feature information, realize the integration of global and local information, and ensure the integrity of image structure. Finally, we adopt the SIoU loss function as the localization loss to expedite model convergence. In order to verify the effectiveness of the proposed method, we conduct experiments on the mainstream datasets DIOR and NWPU VHR-10, which fully demonstrate the superiority of the proposed method.",

keywords = "Object detection, receptive field, remote sensing image, semantic fusion (SF)",

author = "Yunzuo Zhang and Ting Liu and Puze Yu and Shuangshuang Wang and Ran Tao",

note = "Publisher Copyright: {\textcopyright} 1980-2012 IEEE.",

year = "2024",

doi = "10.1109/TGRS.2024.3387572",

language = "English",

volume = "62",

pages = "1--10",

journal = "IEEE Transactions on Geoscience and Remote Sensing",

issn = "0196-2892",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - SFSANet

T2 - Multiscale Object Detection in Remote Sensing Image Based on Semantic Fusion and Scale Adaptability

AU - Zhang, Yunzuo

AU - Liu, Ting

AU - Yu, Puze

AU - Wang, Shuangshuang

AU - Tao, Ran

PY - 2024

Y1 - 2024

N2 - In the field of computer vision, remote sensing image object detection plays an important role. Although the object detection algorithm has made significant progress, there are still problems in detecting objects with multiscale in remote sensing image. Due to the insufficient utilization of object feature information, the detection accuracy of multiscale objects is very low. To address the aforementioned issues, this article proposes an effective object detection algorithm for remote sensing image based on semantic fusion and scale adaptability (SFSANet). First, in view of the problem that the existing methods ignore the semantic differences between different scale feature maps, the semantic fusion (SF) module is proposed to enrich the semantic information and improve the ability to classify and locate objects. Next, to address the issue of the objects being easily interfered in complex background and the detection performance is poor, the spatial location attention (SLA) module is constructed to suppress background information and make key objects more prominent. Additionally, the scale adaptability (SA) module is designed to enrich the expression of feature information, realize the integration of global and local information, and ensure the integrity of image structure. Finally, we adopt the SIoU loss function as the localization loss to expedite model convergence. In order to verify the effectiveness of the proposed method, we conduct experiments on the mainstream datasets DIOR and NWPU VHR-10, which fully demonstrate the superiority of the proposed method.

AB - In the field of computer vision, remote sensing image object detection plays an important role. Although the object detection algorithm has made significant progress, there are still problems in detecting objects with multiscale in remote sensing image. Due to the insufficient utilization of object feature information, the detection accuracy of multiscale objects is very low. To address the aforementioned issues, this article proposes an effective object detection algorithm for remote sensing image based on semantic fusion and scale adaptability (SFSANet). First, in view of the problem that the existing methods ignore the semantic differences between different scale feature maps, the semantic fusion (SF) module is proposed to enrich the semantic information and improve the ability to classify and locate objects. Next, to address the issue of the objects being easily interfered in complex background and the detection performance is poor, the spatial location attention (SLA) module is constructed to suppress background information and make key objects more prominent. Additionally, the scale adaptability (SA) module is designed to enrich the expression of feature information, realize the integration of global and local information, and ensure the integrity of image structure. Finally, we adopt the SIoU loss function as the localization loss to expedite model convergence. In order to verify the effectiveness of the proposed method, we conduct experiments on the mainstream datasets DIOR and NWPU VHR-10, which fully demonstrate the superiority of the proposed method.

KW - Object detection

KW - receptive field

KW - remote sensing image

KW - semantic fusion (SF)

UR - http://www.scopus.com/inward/record.url?scp=85190358480&partnerID=8YFLogxK

U2 - 10.1109/TGRS.2024.3387572

DO - 10.1109/TGRS.2024.3387572

M3 - Article

AN - SCOPUS:85190358480

SN - 0196-2892

VL - 62

SP - 1

EP - 10

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

M1 - 4406410

ER -

SFSANet: Multiscale Object Detection in Remote Sensing Image Based on Semantic Fusion and Scale Adaptability

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this