Bilevel Scheduled Sampling for Dialogue Generation

Jiawen Liu, Kan Li*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Exposure bias poses a common challenge in numerous natural language processing tasks, particularly in the dialog generation. In response to this issue, researchers have devised various techniques, among which scheduled sampling has proven to be an effective method for mitigating exposure bias. However, the existing state-of-the-art scheduled sampling methods solely consider the current sampling words’ quality for threshold truncation sampling, which overlooks the importance of sentence-level information and the method of threshold truncation warrants further discussion. In this paper, we propose a bilevel scheduled sampling model that takes the sentence-level information into account and incorporates it with word-level quality. To enhance sampling diversity and improve the model’s adaptability, we propose a smooth function that maps the combined result of sentence-level and word-level information to an appropriate range, and employ probabilistic sampling based on the mapped values instead of threshold truncation. Experiments conducted on the DailyDialog and PersonaChat datasets demonstrate the effectiveness of our proposed methods, which significantly alleviate the exposure bias problem and outperform state-of-the-art scheduled sampling methods.

源语言英语
主期刊名Natural Language Processing and Chinese Computing - 12th National CCF Conference, NLPCC 2023, Proceedings
编辑Fei Liu, Nan Duan, Qingting Xu, Yu Hong
出版商Springer Science and Business Media Deutschland GmbH
827-839
页数13
ISBN(印刷版)9783031446924
DOI
出版状态已出版 - 2023
活动12th National CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2023 - Foshan, 中国
期限: 12 10月 202315 10月 2023

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14302 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议12th National CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2023
国家/地区中国
Foshan
时期12/10/2315/10/23

指纹

探究 'Bilevel Scheduled Sampling for Dialogue Generation' 的科研主题。它们共同构成独一无二的指纹。

引用此

Liu, J., & Li, K. (2023). Bilevel Scheduled Sampling for Dialogue Generation. 在 F. Liu, N. Duan, Q. Xu, & Y. Hong (编辑), Natural Language Processing and Chinese Computing - 12th National CCF Conference, NLPCC 2023, Proceedings (页码 827-839). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 14302 LNAI). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-44693-1_64