TY - GEN
T1 - Adaptive Spatial Modeling and Multi-Scale Attention Aggregation for Semantic Segmentation of Remote Sensing Images
AU - Yang, Wenying
AU - Zhang, Yuchuan
AU - Jin, Yuxuan
AU - Wang, Yupei
AU - Chen, Liang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - As remote sensing technology advances and high-resolution sensors are deployed, the analysis of high-resolution remote sensing images encounters challenging issues, such as intra-class variability, and inter-class similarity, due to the unobvious appearance and complicated background. The high-resolution remote sensing images pose critical issues for precise semantic segmentation due to these fundamental issues. To this end, we propose an adaptive hierarchical aggregation network. Different from the previous approach of modeling semantic interdependencies solely within a single feature map in the channel dimension, we introduce multi-scale channel-wise feature aggregation to capture global contextual dependencies. Meanwhile, we design an adaptive detail-aware module to model long-range spatial dependencies, adaptively extracting effective details from different levels as compensation. Additionally, this module utilizes depth-wise separable convolution to selectively filter out irrelevant spatial details from lower-level feature maps. The outputs of the two modules are fused layer by layer to achieve refined feature representation, thereby generating precise pixel-level segmentation results. Our experiments are conducted on the Vaihingen dataset and the Potsdam dataset. The experimental results demonstrate that the proposed algorithm surpasses several existing state-of-the-art approaches.
AB - As remote sensing technology advances and high-resolution sensors are deployed, the analysis of high-resolution remote sensing images encounters challenging issues, such as intra-class variability, and inter-class similarity, due to the unobvious appearance and complicated background. The high-resolution remote sensing images pose critical issues for precise semantic segmentation due to these fundamental issues. To this end, we propose an adaptive hierarchical aggregation network. Different from the previous approach of modeling semantic interdependencies solely within a single feature map in the channel dimension, we introduce multi-scale channel-wise feature aggregation to capture global contextual dependencies. Meanwhile, we design an adaptive detail-aware module to model long-range spatial dependencies, adaptively extracting effective details from different levels as compensation. Additionally, this module utilizes depth-wise separable convolution to selectively filter out irrelevant spatial details from lower-level feature maps. The outputs of the two modules are fused layer by layer to achieve refined feature representation, thereby generating precise pixel-level segmentation results. Our experiments are conducted on the Vaihingen dataset and the Potsdam dataset. The experimental results demonstrate that the proposed algorithm surpasses several existing state-of-the-art approaches.
KW - deep learning
KW - Remote sensing image
KW - semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=86000027253&partnerID=8YFLogxK
U2 - 10.1109/ICSIDP62679.2024.10869096
DO - 10.1109/ICSIDP62679.2024.10869096
M3 - Conference contribution
AN - SCOPUS:86000027253
T3 - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
BT - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
Y2 - 22 November 2024 through 24 November 2024
ER -