跳到主要导航 跳到搜索 跳到主要内容

Can Cross-Lingual Transferability of Multilingual Transformers Be Activated Without End-Task Data?

  • Beijing Institute of Technology
  • Beijing Engineering Research Center of High Volume Language

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Pretrained multilingual Transformers have achieved great success in cross-lingual transfer learning. Current methods typically activate the cross-lingual transferability of multilingual Transformers by fine-tuning them on end-task data. However, the methods cannot perform cross-lingual transfer when end-task data are unavailable. In this work, we explore whether the cross-lingual transferability can be activated without end-task data. We propose a cross-lingual transfer method, named PLUGIN-X. PLUGIN-X disassembles monolingual and multilingual Transformers into sub-modules, and reassembles them to be the multilingual end-task model. After representation adaptation, PLUGIN-X finally performs cross-lingual transfer in a plug-and-play style. Experimental results show that PLUGIN-X successfully activates the cross-lingual transferability of multilingual Transformers without accessing end-task data. Moreover, we analyze how the cross-model representation alignment affects the cross-lingual transferability.

源语言英语
主期刊名Findings of the Association for Computational Linguistics, ACL 2023
出版商Association for Computational Linguistics (ACL)
12572-12584
页数13
ISBN(电子版)9781959429623
DOI
出版状态已出版 - 2023
已对外发布
活动Findings of the Association for Computational Linguistics, ACL 2023 - Toronto, 加拿大
期限: 9 7月 202314 7月 2023

出版系列

姓名Proceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN(印刷版)0736-587X

会议

会议Findings of the Association for Computational Linguistics, ACL 2023
国家/地区加拿大
Toronto
时期9/07/2314/07/23

指纹

探究 'Can Cross-Lingual Transferability of Multilingual Transformers Be Activated Without End-Task Data?' 的科研主题。它们共同构成独一无二的指纹。

引用此