Causality Inspired Representation Learning for Domain Generalization

Fangrui Lv; Jian Liang; Shuang Li; Bin Zang; Chi Harold Liu; Ziteng Wang; Di Liu

doi:10.1109/CVPR52688.2022.00788

Causality Inspired Representation Learning for Domain Generalization

Fangrui Lv, Jian Liang, Shuang Li^*, Bin Zang, Chi Harold Liu, Ziteng Wang, Di Liu

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

113 引用（Scopus）

摘要

Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 11Code is available at 'https://github.com/BIT-DA/CIRL'.

源语言	英语
主期刊名	Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
出版商	IEEE Computer Society
页	8036-8046
页数	11
ISBN（电子版）	9781665469463
DOI	https://doi.org/10.1109/CVPR52688.2022.00788
出版状态	已出版 - 2022
活动	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 - New Orleans, 美国期限: 19 6月 2022 → 24 6月 2022

出版系列

姓名	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
卷	2022-June
ISSN（印刷版）	1063-6919

会议

会议	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
国家/地区	美国
市	New Orleans
时期	19/06/22 → 24/06/22

访问文件

10.1109/CVPR52688.2022.00788

其它文件与链接

链接到 Scopus 的出版物

引用此

Lv, F., Liang, J., Li, S., Zang, B., Liu, C. H., Wang, Z., & Liu, D. (2022). Causality Inspired Representation Learning for Domain Generalization. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 (页码 8036-8046). (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2022-June). IEEE Computer Society. https://doi.org/10.1109/CVPR52688.2022.00788

@inproceedings{1edb3cbd8e9d45c88813e0e1d3d6b284,

title = "Causality Inspired Representation Learning for Domain Generalization",

abstract = "Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 11Code is available at 'https://github.com/BIT-DA/CIRL'.",

keywords = "Machine learning, Recognition: detection, Representation learning, Self- & semi- & meta- & unsupervised learning, Transfer/low-shot/long-tail learning, categorization, retrieval",

author = "Fangrui Lv and Jian Liang and Shuang Li and Bin Zang and Liu, {Chi Harold} and Ziteng Wang and Di Liu",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 ; Conference date: 19-06-2022 Through 24-06-2022",

year = "2022",

doi = "10.1109/CVPR52688.2022.00788",

language = "English",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "8036--8046",

booktitle = "Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022",

address = "United States",

}

Lv, F, Liang, J, Li, S, Zang, B, Liu, CH, Wang, Z & Liu, D 2022, Causality Inspired Representation Learning for Domain Generalization. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 卷 2022-June, IEEE Computer Society, 页码 8036-8046, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, 美国, 19/06/22. https://doi.org/10.1109/CVPR52688.2022.00788

Causality Inspired Representation Learning for Domain Generalization. / Lv, Fangrui; Liang, Jian; Li, Shuang 等.
Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society, 2022. 页码 8036-8046 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; 卷 2022-June).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Causality Inspired Representation Learning for Domain Generalization

AU - Lv, Fangrui

AU - Liang, Jian

AU - Li, Shuang

AU - Zang, Bin

AU - Liu, Chi Harold

AU - Wang, Ziteng

AU - Liu, Di

PY - 2022

Y1 - 2022

N2 - Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 11Code is available at 'https://github.com/BIT-DA/CIRL'.

AB - Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 11Code is available at 'https://github.com/BIT-DA/CIRL'.

KW - Machine learning

KW - Recognition: detection

KW - Representation learning

KW - Self- & semi- & meta- & unsupervised learning

KW - Transfer/low-shot/long-tail learning

KW - categorization

KW - retrieval

UR - http://www.scopus.com/inward/record.url?scp=85134219924&partnerID=8YFLogxK

U2 - 10.1109/CVPR52688.2022.00788

DO - 10.1109/CVPR52688.2022.00788

M3 - Conference contribution

AN - SCOPUS:85134219924

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 8036

EP - 8046

BT - Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

PB - IEEE Computer Society

T2 - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022

Y2 - 19 June 2022 through 24 June 2022

ER -

Lv F, Liang J, Li S, Zang B, Liu CH, Wang Z 等. Causality Inspired Representation Learning for Domain Generalization. 在 Proceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022. IEEE Computer Society. 2022. 页码 8036-8046. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR52688.2022.00788

Causality Inspired Representation Learning for Domain Generalization

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此