TY - GEN
T1 - Non-local Architecture Net Semantic Segmentation Network for Remote Sensing Images
AU - Xu, Fengxiang
AU - Xu, Tingfa
AU - Zhang, Nan
AU - Zhao, Wangcai
AU - Chen, Zhenxiang
AU - Li, Xiaohang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Remote sensing images play a vital role in geographic information acquisition and resource management. However, challenges such as complex terrain variations, varying object scales, and diverse lighting and weather conditions often hinder the accuracy of edge segmentation and small-object detection in existing methods. Traditional approaches typically rely on stacking multiple convolutional layers to expand the receptive field, which not only increases computational costs but also results in incomplete edge segmentation and poor recognition of small objects.To address these limitations, this paper introduces a Non-Local U-net (NLUnet) model based on the U-net architecture. NLUnet builds upon U-net's strengths in local feature extraction by integrating a non-local mechanism that captures long-range dependencies, thereby effectively expanding the receptive field. This approach enhances the capture of global contextual information and reduces computational costs, particularly excelling in complex scenes. NLUnet significantly improves the segmentation of small objects and the completeness of edge segmentation.Experimental results on the Aerial Imagery dataset demonstrate the superior performance of NLUnet, achieving an IoU of 93%, an OA of 92%, and an average F1 score of 91%. NLUnet outperforms other methods in small-object segmentation, underscoring its effectiveness in complex remote sensing scenarios.
AB - Remote sensing images play a vital role in geographic information acquisition and resource management. However, challenges such as complex terrain variations, varying object scales, and diverse lighting and weather conditions often hinder the accuracy of edge segmentation and small-object detection in existing methods. Traditional approaches typically rely on stacking multiple convolutional layers to expand the receptive field, which not only increases computational costs but also results in incomplete edge segmentation and poor recognition of small objects.To address these limitations, this paper introduces a Non-Local U-net (NLUnet) model based on the U-net architecture. NLUnet builds upon U-net's strengths in local feature extraction by integrating a non-local mechanism that captures long-range dependencies, thereby effectively expanding the receptive field. This approach enhances the capture of global contextual information and reduces computational costs, particularly excelling in complex scenes. NLUnet significantly improves the segmentation of small objects and the completeness of edge segmentation.Experimental results on the Aerial Imagery dataset demonstrate the superior performance of NLUnet, achieving an IoU of 93%, an OA of 92%, and an average F1 score of 91%. NLUnet outperforms other methods in small-object segmentation, underscoring its effectiveness in complex remote sensing scenarios.
KW - Non-local mechanism
KW - remote sensing image
KW - Semantic segmentation
KW - small scale objects
KW - U-net
UR - http://www.scopus.com/inward/record.url?scp=86000008457&partnerID=8YFLogxK
U2 - 10.1109/ICSIDP62679.2024.10867926
DO - 10.1109/ICSIDP62679.2024.10867926
M3 - Conference contribution
AN - SCOPUS:86000008457
T3 - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
BT - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
Y2 - 22 November 2024 through 24 November 2024
ER -