Transferable attention networks for adversarial domain adaptation

Changchun Zhang; Qingjie Zhao; Yu Wang

doi:10.1016/j.ins.2020.06.016

Transferable attention networks for adversarial domain adaptation

Changchun Zhang, Qingjie Zhao^*, Yu Wang

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

32 Citations (Scopus)

Abstract

Domain adaptation is one of the fundamental challenges in transfer learning. How to effectively transfer knowledge from labeled source domain to unlabeled target domain is critical for domain adaptation, as it benefits to reduce the considerable performance gap due to domain shift. Existing methods of domain adaptation address this issue via matching the global features across domains. However, not all features are transferable for domain adaptation, while forcefully matching the untransferable features may lead to negative transfer. In this paper, we propose a novel method dubbed transferable attention networks (TAN) to address this issue. The proposed TAN focuses on the feature alignment by utilizing adversarial optimization. Specifically, we utilize the self-attention mechanism to weight extracted features, such that the influence of untransferable features can be effectively eliminated. Meanwhile, to exploit the complex multi-modal structures of domain adaptation, we use learned features and classifier predictions as the condition to train the adversarial networks. Furthermore, we further propose that the accurately transferable features should enable domain discrepancy to minimum. Three loss functions are introduced into the adversarial networks: classification loss, attention transfer loss, and condition transfer loss. Extensive experiments on Office-31, ImageCLEF-DA, Office-Home, and VisDA-2017 datasets testify that the proposed approach yields state-of-the-art results.

Original language	English
Pages (from-to)	422-433
Number of pages	12
Journal	Information Sciences
Volume	539
DOIs	https://doi.org/10.1016/j.ins.2020.06.016
Publication status	Published - Oct 2020

Keywords

Adversarial networks.
Domain adaptation
Self-attention mechanism

Access to Document

10.1016/j.ins.2020.06.016

Cite this

Zhang, C., Zhao, Q., & Wang, Y. (2020). Transferable attention networks for adversarial domain adaptation. Information Sciences, 539, 422-433. https://doi.org/10.1016/j.ins.2020.06.016

@article{fb2053f21b58400ea541017b3ca76af7,

title = "Transferable attention networks for adversarial domain adaptation",

abstract = "Domain adaptation is one of the fundamental challenges in transfer learning. How to effectively transfer knowledge from labeled source domain to unlabeled target domain is critical for domain adaptation, as it benefits to reduce the considerable performance gap due to domain shift. Existing methods of domain adaptation address this issue via matching the global features across domains. However, not all features are transferable for domain adaptation, while forcefully matching the untransferable features may lead to negative transfer. In this paper, we propose a novel method dubbed transferable attention networks (TAN) to address this issue. The proposed TAN focuses on the feature alignment by utilizing adversarial optimization. Specifically, we utilize the self-attention mechanism to weight extracted features, such that the influence of untransferable features can be effectively eliminated. Meanwhile, to exploit the complex multi-modal structures of domain adaptation, we use learned features and classifier predictions as the condition to train the adversarial networks. Furthermore, we further propose that the accurately transferable features should enable domain discrepancy to minimum. Three loss functions are introduced into the adversarial networks: classification loss, attention transfer loss, and condition transfer loss. Extensive experiments on Office-31, ImageCLEF-DA, Office-Home, and VisDA-2017 datasets testify that the proposed approach yields state-of-the-art results.",

keywords = "Adversarial networks., Domain adaptation, Self-attention mechanism",

author = "Changchun Zhang and Qingjie Zhao and Yu Wang",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier Inc.",

year = "2020",

month = oct,

doi = "10.1016/j.ins.2020.06.016",

language = "English",

volume = "539",

pages = "422--433",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Transferable attention networks for adversarial domain adaptation

AU - Zhang, Changchun

AU - Zhao, Qingjie

AU - Wang, Yu

PY - 2020/10

Y1 - 2020/10

N2 - Domain adaptation is one of the fundamental challenges in transfer learning. How to effectively transfer knowledge from labeled source domain to unlabeled target domain is critical for domain adaptation, as it benefits to reduce the considerable performance gap due to domain shift. Existing methods of domain adaptation address this issue via matching the global features across domains. However, not all features are transferable for domain adaptation, while forcefully matching the untransferable features may lead to negative transfer. In this paper, we propose a novel method dubbed transferable attention networks (TAN) to address this issue. The proposed TAN focuses on the feature alignment by utilizing adversarial optimization. Specifically, we utilize the self-attention mechanism to weight extracted features, such that the influence of untransferable features can be effectively eliminated. Meanwhile, to exploit the complex multi-modal structures of domain adaptation, we use learned features and classifier predictions as the condition to train the adversarial networks. Furthermore, we further propose that the accurately transferable features should enable domain discrepancy to minimum. Three loss functions are introduced into the adversarial networks: classification loss, attention transfer loss, and condition transfer loss. Extensive experiments on Office-31, ImageCLEF-DA, Office-Home, and VisDA-2017 datasets testify that the proposed approach yields state-of-the-art results.

AB - Domain adaptation is one of the fundamental challenges in transfer learning. How to effectively transfer knowledge from labeled source domain to unlabeled target domain is critical for domain adaptation, as it benefits to reduce the considerable performance gap due to domain shift. Existing methods of domain adaptation address this issue via matching the global features across domains. However, not all features are transferable for domain adaptation, while forcefully matching the untransferable features may lead to negative transfer. In this paper, we propose a novel method dubbed transferable attention networks (TAN) to address this issue. The proposed TAN focuses on the feature alignment by utilizing adversarial optimization. Specifically, we utilize the self-attention mechanism to weight extracted features, such that the influence of untransferable features can be effectively eliminated. Meanwhile, to exploit the complex multi-modal structures of domain adaptation, we use learned features and classifier predictions as the condition to train the adversarial networks. Furthermore, we further propose that the accurately transferable features should enable domain discrepancy to minimum. Three loss functions are introduced into the adversarial networks: classification loss, attention transfer loss, and condition transfer loss. Extensive experiments on Office-31, ImageCLEF-DA, Office-Home, and VisDA-2017 datasets testify that the proposed approach yields state-of-the-art results.

KW - Adversarial networks.

KW - Domain adaptation

KW - Self-attention mechanism

UR - http://www.scopus.com/inward/record.url?scp=85086827503&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2020.06.016

DO - 10.1016/j.ins.2020.06.016

M3 - Article

AN - SCOPUS:85086827503

SN - 0020-0255

VL - 539

SP - 422

EP - 433

JO - Information Sciences

JF - Information Sciences

ER -

Transferable attention networks for adversarial domain adaptation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this