Hierarchical Inductive Transfer for Continual Dialogue Learning

Shaoxiong Feng, Xuancheng Ren, Kan Li*, Xu Sun*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Pre-trained models have achieved excellent performance on the dialogue task. However, for the continual increase of online chit-chat scenarios, directly fine-tuning these models for each of the new tasks not only explodes the capacity of the dialogue system on the embedded devices but also causes knowledge forgetting on pre-trained models and knowledge interference among diverse dialogue tasks. In this work, we propose a hierarchical inductive transfer framework to learn and deploy the dialogue skills continually and efficiently. First, we introduce the adapter module into pre-trained models for learning new dialogue tasks. As the only trainable module, it is beneficial for the dialogue system on the embedded devices to acquire new dialogue skills with negligible additional parameters. Then, for alleviating knowledge interference between tasks yet benefiting the regularization between them, we further design hierarchical inductive transfer that enables new tasks to use general knowledge in the base adapter without being misled by diverse knowledge in task-specific adapters. Empirical evaluation and analysis indicate that our framework obtains comparable performance under deployment-friendly model capacity.

源语言英语
主期刊名ACL 2022 - 60th Annual Meeting of the Association for Computational Linguistics, Findings of ACL 2022
编辑Smaranda Muresan, Preslav Nakov, Aline Villavicencio
出版商Association for Computational Linguistics (ACL)
693-699
页数7
ISBN(电子版)9781955917254
出版状态已出版 - 2022
活动60th Annual Meeting of the Association for Computational Linguistics, ACL 2022 - Dublin, 爱尔兰
期限: 22 5月 202227 5月 2022

出版系列

姓名Proceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN(印刷版)0736-587X

会议

会议60th Annual Meeting of the Association for Computational Linguistics, ACL 2022
国家/地区爱尔兰
Dublin
时期22/05/2227/05/22

指纹

探究 'Hierarchical Inductive Transfer for Continual Dialogue Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此