Doge Tickets: Uncovering Domain-General Language Models by Playing Lottery Tickets

Yi Yang, Chen Zhang, Benyou Wang, Dawei Song*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)

摘要

Over-parameterized pre-trained language models (LMs), have shown an appealing expressive power due to their small learning bias. However, the huge learning capacity of LMs can also lead to large learning variance. In a pilot study, we find that, when faced with multiple domains, a critical portion of parameters behave unexpectedly in a domain-specific manner while others behave in a domain-general one. Motivated by this phenomenon, we for the first time posit that domain-general parameters can underpin a domain-general LM that can be derived from the original LM. To uncover the domain-general LM, we propose to identify domain-general parameters by playing lottery tickets (dubbed doge tickets). In order to intervene the lottery, we propose a domain-general score, which depicts how domain-invariant a parameter is by associating it with the variance. Comprehensive experiments are conducted on the Amazon, Mnli, and OntoNotes datasets. The results show that the doge tickets obtains an improved out-of-domain generalization in comparison with a range of competitive baselines. Analysis results further hint the existence of domain-general parameters and the performance consistency of doge tickets.

源语言英语
主期刊名Natural Language Processing and Chinese Computing - 11th CCF International Conference, NLPCC 2022, Proceedings
编辑Wei Lu, Shujian Huang, Yu Hong, Xiabing Zhou
出版商Springer Science and Business Media Deutschland GmbH
144-156
页数13
ISBN(印刷版)9783031171192
DOI
出版状态已出版 - 2022
活动11th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2022 - Guilin, 中国
期限: 24 9月 202225 9月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13551 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议11th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2022
国家/地区中国
Guilin
时期24/09/2225/09/22

指纹

探究 'Doge Tickets: Uncovering Domain-General Language Models by Playing Lottery Tickets' 的科研主题。它们共同构成独一无二的指纹。

引用此