Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery

Shu Wang; Jianhong Han; Ying Wang; Xinyuan Hao; Zhaoyi Luo; Yupei Wang; Liang Chen

doi:10.1109/ICSIDP62679.2024.10868768

Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery

Shu Wang, Jianhong Han, Ying Wang, Xinyuan Hao, Zhaoyi Luo, Yupei Wang^*, Liang Chen

^*Corresponding author for this work

School of Information and Electronics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Unsupervised Domain Adaptation (UDA) techniques are crucial for remote sensing object detection, designed to address performance degradation caused by the domain gap between training and test data. These methods leverage unlabeled target domain data, thus alleviating the high costs associated with data annotation. Recent developments in Detection Transformers (DETR) have simplified the detection pipeline and attracted significant research interest. Building on this architecture, we introduce an unsupervised domain adaptation detector for remote sensing object detection. Specifically, we introduce a multi-view adaptive feature alignment module that initially captures domain-specific features in complex backgrounds by leveraging a cross-attention mechanism. Subsequently, we employ contrastive learning to enforce the aggregation of domain-specific features from various perspectives, thereby improving the accuracy of feature alignment. Moreover, we demonstrate that integrating the self-training framework into DETR-based detectors can significantly mitigate the domain gap by further utilizing unlabeled data in the target domain. We validated the effectiveness and generalizability of our method across two remote sensing cross-domain detection scenarios using four public datasets.

Original language	English
Title of host publication	IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9798331515669
DOIs	https://doi.org/10.1109/ICSIDP62679.2024.10868768
Publication status	Published - 2024
Event	2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024 - Zhuhai, China Duration: 22 Nov 2024 → 24 Nov 2024

Publication series

Name	IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024

Conference

Conference	2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024
Country/Territory	China
City	Zhuhai
Period	22/11/24 → 24/11/24

Keywords

object detection
remote sensing imagery
Unsupervised domain adaptation

Access to Document

10.1109/ICSIDP62679.2024.10868768

Cite this

Wang, S., Han, J., Wang, Y., Hao, X., Luo, Z., Wang, Y., & Chen, L. (2024). Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery. In IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024 (IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICSIDP62679.2024.10868768

Wang, Shu ; Han, Jianhong ; Wang, Ying et al. / Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery. IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024. Institute of Electrical and Electronics Engineers Inc., 2024. (IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024).

@inproceedings{59bf962a62d140649b166db6de148e8b,

title = "Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery",

abstract = "Unsupervised Domain Adaptation (UDA) techniques are crucial for remote sensing object detection, designed to address performance degradation caused by the domain gap between training and test data. These methods leverage unlabeled target domain data, thus alleviating the high costs associated with data annotation. Recent developments in Detection Transformers (DETR) have simplified the detection pipeline and attracted significant research interest. Building on this architecture, we introduce an unsupervised domain adaptation detector for remote sensing object detection. Specifically, we introduce a multi-view adaptive feature alignment module that initially captures domain-specific features in complex backgrounds by leveraging a cross-attention mechanism. Subsequently, we employ contrastive learning to enforce the aggregation of domain-specific features from various perspectives, thereby improving the accuracy of feature alignment. Moreover, we demonstrate that integrating the self-training framework into DETR-based detectors can significantly mitigate the domain gap by further utilizing unlabeled data in the target domain. We validated the effectiveness and generalizability of our method across two remote sensing cross-domain detection scenarios using four public datasets.",

keywords = "object detection, remote sensing imagery, Unsupervised domain adaptation",

author = "Shu Wang and Jianhong Han and Ying Wang and Xinyuan Hao and Zhaoyi Luo and Yupei Wang and Liang Chen",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024 ; Conference date: 22-11-2024 Through 24-11-2024",

year = "2024",

doi = "10.1109/ICSIDP62679.2024.10868768",

language = "English",

series = "IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024",

address = "United States",

}

Wang, S, Han, J, Wang, Y, Hao, X, Luo, Z, Wang, Y & Chen, L 2024, Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery. in IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024. IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024, Institute of Electrical and Electronics Engineers Inc., 2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024, Zhuhai, China, 22/11/24. https://doi.org/10.1109/ICSIDP62679.2024.10868768

Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery. / Wang, Shu; Han, Jianhong; Wang, Ying et al.
IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024. Institute of Electrical and Electronics Engineers Inc., 2024. (IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery

AU - Wang, Shu

AU - Han, Jianhong

AU - Wang, Ying

AU - Hao, Xinyuan

AU - Luo, Zhaoyi

AU - Wang, Yupei

AU - Chen, Liang

PY - 2024

Y1 - 2024

N2 - Unsupervised Domain Adaptation (UDA) techniques are crucial for remote sensing object detection, designed to address performance degradation caused by the domain gap between training and test data. These methods leverage unlabeled target domain data, thus alleviating the high costs associated with data annotation. Recent developments in Detection Transformers (DETR) have simplified the detection pipeline and attracted significant research interest. Building on this architecture, we introduce an unsupervised domain adaptation detector for remote sensing object detection. Specifically, we introduce a multi-view adaptive feature alignment module that initially captures domain-specific features in complex backgrounds by leveraging a cross-attention mechanism. Subsequently, we employ contrastive learning to enforce the aggregation of domain-specific features from various perspectives, thereby improving the accuracy of feature alignment. Moreover, we demonstrate that integrating the self-training framework into DETR-based detectors can significantly mitigate the domain gap by further utilizing unlabeled data in the target domain. We validated the effectiveness and generalizability of our method across two remote sensing cross-domain detection scenarios using four public datasets.

AB - Unsupervised Domain Adaptation (UDA) techniques are crucial for remote sensing object detection, designed to address performance degradation caused by the domain gap between training and test data. These methods leverage unlabeled target domain data, thus alleviating the high costs associated with data annotation. Recent developments in Detection Transformers (DETR) have simplified the detection pipeline and attracted significant research interest. Building on this architecture, we introduce an unsupervised domain adaptation detector for remote sensing object detection. Specifically, we introduce a multi-view adaptive feature alignment module that initially captures domain-specific features in complex backgrounds by leveraging a cross-attention mechanism. Subsequently, we employ contrastive learning to enforce the aggregation of domain-specific features from various perspectives, thereby improving the accuracy of feature alignment. Moreover, we demonstrate that integrating the self-training framework into DETR-based detectors can significantly mitigate the domain gap by further utilizing unlabeled data in the target domain. We validated the effectiveness and generalizability of our method across two remote sensing cross-domain detection scenarios using four public datasets.

KW - object detection

KW - remote sensing imagery

KW - Unsupervised domain adaptation

UR - http://www.scopus.com/inward/record.url?scp=86000024698&partnerID=8YFLogxK

U2 - 10.1109/ICSIDP62679.2024.10868768

DO - 10.1109/ICSIDP62679.2024.10868768

M3 - Conference contribution

AN - SCOPUS:86000024698

T3 - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024

BT - IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2nd IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024

Y2 - 22 November 2024 through 24 November 2024

ER -

Wang S, Han J, Wang Y, Hao X, Luo Z, Wang Y et al. Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery. In IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024. Institute of Electrical and Electronics Engineers Inc. 2024. (IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2024). doi: 10.1109/ICSIDP62679.2024.10868768

Cross-Domain Detection Transformer with Multi-view Adaptive Feature Alignment in Remote Sensing Imagery

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this