TY - GEN
T1 - Multi-scale Change-Aware Transformer for Remote Sensing Image Change Detection
AU - Chen, Huan
AU - Xu, Tingfa
AU - Chen, Zhenxiang
AU - Liu, Peifu
AU - Bai, Huiyan
AU - Li, Jianan
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/10/28
Y1 - 2024/10/28
N2 - Change detection identifies differences between images captured at different times. Real-world change detection faces challenges posed by the diverse and intricate nature of change areas, while current datasets and algorithms are often limited to simpler, consistent changes, reducing their effectiveness in practical applications. Existing dual-branch methods process images independently, risking the loss of change information due to insufficient early interaction. In contrast, single-stream approaches, though improving early integration, lack efficacy in capturing complex changes. To address these limitations, we introduce a novel single-stream framework, the Multi-scale Change-Aware Transformer (MCAT), which features the Dynamic Change-Aware Attention module and the Multi-scale Change-Enhanced Aggregator. The Dynamic Change-Aware Attention module, integrating local self-attention and cross-temporal attention, conducts dynamic iteration on images differences, thereby targeting feature extraction of change areas. The Multi-scale Change-Enhanced Aggregator enables the model to adapt to various scales and complex shapes through local change enhancement and multi-scale aggregation strategies. To overcome the limitations of existing datasets regarding the scale diversity and morphological complexity of change areas, we construct the Mining Area Change Detection dataset. The dataset offers a diverse array of change areas that span multiple scales and exhibit complex shapes, providing a robust benchmark for change detection. Extensive experiments demonstrate that our model outperforms existing methods, especially for irregular and multi-scale changes. Codes and dataset are available at https://github.com/chh11/MCAT.
AB - Change detection identifies differences between images captured at different times. Real-world change detection faces challenges posed by the diverse and intricate nature of change areas, while current datasets and algorithms are often limited to simpler, consistent changes, reducing their effectiveness in practical applications. Existing dual-branch methods process images independently, risking the loss of change information due to insufficient early interaction. In contrast, single-stream approaches, though improving early integration, lack efficacy in capturing complex changes. To address these limitations, we introduce a novel single-stream framework, the Multi-scale Change-Aware Transformer (MCAT), which features the Dynamic Change-Aware Attention module and the Multi-scale Change-Enhanced Aggregator. The Dynamic Change-Aware Attention module, integrating local self-attention and cross-temporal attention, conducts dynamic iteration on images differences, thereby targeting feature extraction of change areas. The Multi-scale Change-Enhanced Aggregator enables the model to adapt to various scales and complex shapes through local change enhancement and multi-scale aggregation strategies. To overcome the limitations of existing datasets regarding the scale diversity and morphological complexity of change areas, we construct the Mining Area Change Detection dataset. The dataset offers a diverse array of change areas that span multiple scales and exhibit complex shapes, providing a robust benchmark for change detection. Extensive experiments demonstrate that our model outperforms existing methods, especially for irregular and multi-scale changes. Codes and dataset are available at https://github.com/chh11/MCAT.
KW - benchmark dataset
KW - change detection
KW - single-stream framework
UR - http://www.scopus.com/inward/record.url?scp=85209798874&partnerID=8YFLogxK
U2 - 10.1145/3664647.3680965
DO - 10.1145/3664647.3680965
M3 - Conference contribution
AN - SCOPUS:85209798874
T3 - MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia
SP - 2992
EP - 3000
BT - MM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia
PB - Association for Computing Machinery, Inc
T2 - 32nd ACM International Conference on Multimedia, MM 2024
Y2 - 28 October 2024 through 1 November 2024
ER -