TY - GEN
T1 - Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery
AU - Wang, Shu
AU - Han, Jianhong
AU - Wang, Ying
AU - Hao, Xinyuan
AU - Luo, Zhaoyi
AU - Wang, Yupei
AU - Chen, Liang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Unsupervised Domain Adaptation (UDA) techniques are crucial for remote sensing object detection, designed to address performance degradation caused by the domain gap between training and test data. These methods leverage unlabeled target domain data, thus alleviating the high costs associated with data annotation. Recent developments in Detection Transformers (DETR) have simplified the detection pipeline and attracted significant research interest. Building on this architecture, we introduce an unsupervised domain adaptation detector for remote sensing object detection. Specifically, we introduce a multi-view adaptive feature alignment module that initially captures domain-specific features in complex backgrounds by leveraging a cross-attention mechanism. Subsequently, we employ contrastive learning to enforce the aggregation of domain-specific features from various perspectives, thereby improving the accuracy of feature alignment. Moreover, we demonstrate that integrating the self-training framework into DETR-based detectors can significantly mitigate the domain gap by further utilizing unlabeled data in the target domain. We validated the effectiveness and generalizability of our method across two remote sensing cross-domain detection scenarios using four public datasets.
AB - Unsupervised Domain Adaptation (UDA) techniques are crucial for remote sensing object detection, designed to address performance degradation caused by the domain gap between training and test data. These methods leverage unlabeled target domain data, thus alleviating the high costs associated with data annotation. Recent developments in Detection Transformers (DETR) have simplified the detection pipeline and attracted significant research interest. Building on this architecture, we introduce an unsupervised domain adaptation detector for remote sensing object detection. Specifically, we introduce a multi-view adaptive feature alignment module that initially captures domain-specific features in complex backgrounds by leveraging a cross-attention mechanism. Subsequently, we employ contrastive learning to enforce the aggregation of domain-specific features from various perspectives, thereby improving the accuracy of feature alignment. Moreover, we demonstrate that integrating the self-training framework into DETR-based detectors can significantly mitigate the domain gap by further utilizing unlabeled data in the target domain. We validated the effectiveness and generalizability of our method across two remote sensing cross-domain detection scenarios using four public datasets.
KW - object detection
KW - remote sensing imagery
KW - Unsupervised domain adaptation
UR - http://www.scopus.com/inward/record.url?scp=86000024698&partnerID=8YFLogxK
U2 - 10.1109/ICSIDP62679.2024.10868768
DO - 10.1109/ICSIDP62679.2024.10868768
M3 - Conference contribution
AN - SCOPUS:86000024698
T3 - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
BT - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
Y2 - 22 November 2024 through 24 November 2024
ER -