PEK: A Parameter-Efficient Framework for Knowledge-Grounded Dialogue Generation

Pan Yang, Dandan Song*, Zhijing Wu, Yanru Zhou, Ziyi Yang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Pre-trained language models (PLMs) have shown great dialogue generation capability in different scenarios. However, the huge VRAM consumption when fine-tuning them is one of their drawbacks. PEFT approaches can significantly reduce the number of trainable parameters, which enables us to fine-tune larger dialogue generation models. However, the reduction in parameter quantity can diminish a PLM's expressive capacity and affect the PLM's learning from certain specific examples like knowledge-related conversations. Previous works have demonstrated that injecting external knowledge into dialogue generation models can improve the model's performance in knowledge-related conversations. Nonetheless, these methods are designed for the scenario where most parameters of the entire framework are trainable. In this paper, we propose PEK, a parameter-efficient framework for knowledge-enhanced dialogue generation. It enables PLMs to leverage external knowledge documents and knowledge graphs to enhance its generation capabilities with an acceptable number of trainable parameters. Evaluation results on the Wizard of Wikipedia and CMU_DoG datasets show that our approach outperforms baseline methods on multiple evaluation metrics, which validates the effectiveness of our approach.

源语言英语
主期刊名62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference
编辑Lun-Wei Ku, Andre Martins, Vivek Srikumar
出版商Association for Computational Linguistics (ACL)
9261-9273
页数13
ISBN(电子版)9798891760998
DOI
出版状态已出版 - 2024
活动Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Hybrid, Bangkok, 泰国
期限: 11 8月 202416 8月 2024

出版系列

姓名Proceedings of the Annual Meeting of the Association for Computational Linguistics
ISSN(印刷版)0736-587X

会议

会议Findings of the 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024
国家/地区泰国
Hybrid, Bangkok
时期11/08/2416/08/24

指纹

探究 'PEK: A Parameter-Efficient Framework for Knowledge-Grounded Dialogue Generation' 的科研主题。它们共同构成独一无二的指纹。

引用此

Yang, P., Song, D., Wu, Z., Zhou, Y., & Yang, Z. (2024). PEK: A Parameter-Efficient Framework for Knowledge-Grounded Dialogue Generation. 在 L.-W. Ku, A. Martins, & V. Srikumar (编辑), 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Proceedings of the Conference (页码 9261-9273). (Proceedings of the Annual Meeting of the Association for Computational Linguistics). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2024.findings-acl.550