Towards Faithful Dialogs via Focus Learning

Yifan Deng; Xingsheng Zhang; Heyan Huang; Yue Hu

Towards Faithful Dialogs via Focus Learning

Yifan Deng, Xingsheng Zhang^*, Heyan Huang^*, Yue Hu

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

4 引用（Scopus）

摘要

Maintaining faithfulness between responses and knowledge is an important research topic for building reliable knowledge-grounded dialogue systems. Existing models heavily rely on the elaborate data engineering and increasing the model's parameters ignoring to track the tokens that significantly influence losses, which is decisive for the optimization direction of the model in each iteration. To address this issue, we propose Focus Learning (FocusL), a novel learning approach that adjusts the contribution of each token to the optimization direction by directly scaling the corresponding objective loss. Specifically, we first introduce a positioning method by utilizing relevance distributions between knowledge and each response token to locate knowledge-aware tokens. Then, we further design a relevance-to-weight transformation to provide dynamic token-level weights for adjusting the cross-entropy loss. Finally, we use the weighted loss to encourage the model to pay special attention to the knowledge utilization. Experimental results demonstrate that our method achieves the new state-of-the-art results and generates more reliable responses while maintaining training stability.

源语言	英语
主期刊名	Long Papers
出版商	Association for Computational Linguistics (ACL)
页	4554-4566
页数	13
ISBN（电子版）	9781959429722
出版状态	已出版 - 2023
活动	61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Toronto, 加拿大期限: 9 7月 2023 → 14 7月 2023

出版系列

姓名	Proceedings of the Annual Meeting of the Association for Computational Linguistics
卷	1
ISSN（印刷版）	0736-587X

会议

会议	61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
国家/地区	加拿大
市	Toronto
时期	9/07/23 → 14/07/23

其它文件与链接

链接到 Scopus 的出版物

引用此

@inproceedings{77e54ac8ccf34e6d93d8428afbe58162,

title = "Towards Faithful Dialogs via Focus Learning",

abstract = "Maintaining faithfulness between responses and knowledge is an important research topic for building reliable knowledge-grounded dialogue systems. Existing models heavily rely on the elaborate data engineering and increasing the model's parameters ignoring to track the tokens that significantly influence losses, which is decisive for the optimization direction of the model in each iteration. To address this issue, we propose Focus Learning (FocusL), a novel learning approach that adjusts the contribution of each token to the optimization direction by directly scaling the corresponding objective loss. Specifically, we first introduce a positioning method by utilizing relevance distributions between knowledge and each response token to locate knowledge-aware tokens. Then, we further design a relevance-to-weight transformation to provide dynamic token-level weights for adjusting the cross-entropy loss. Finally, we use the weighted loss to encourage the model to pay special attention to the knowledge utilization. Experimental results demonstrate that our method achieves the new state-of-the-art results and generates more reliable responses while maintaining training stability.",

author = "Yifan Deng and Xingsheng Zhang and Heyan Huang and Yue Hu",

note = "Publisher Copyright: {\textcopyright} 2023 Association for Computational Linguistics.; 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 ; Conference date: 09-07-2023 Through 14-07-2023",

year = "2023",

language = "English",

series = "Proceedings of the Annual Meeting of the Association for Computational Linguistics",

publisher = "Association for Computational Linguistics (ACL)",

pages = "4554--4566",

booktitle = "Long Papers",

address = "United States",

}

Deng, Y, Zhang, X, Huang, H & Hu, Y 2023, Towards Faithful Dialogs via Focus Learning. 在 Long Papers. Proceedings of the Annual Meeting of the Association for Computational Linguistics, 卷 1, Association for Computational Linguistics (ACL), 页码 4554-4566, 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023, Toronto, 加拿大, 9/07/23.

TY - GEN

T1 - Towards Faithful Dialogs via Focus Learning

AU - Deng, Yifan

AU - Zhang, Xingsheng

AU - Huang, Heyan

AU - Hu, Yue

PY - 2023

Y1 - 2023

N2 - Maintaining faithfulness between responses and knowledge is an important research topic for building reliable knowledge-grounded dialogue systems. Existing models heavily rely on the elaborate data engineering and increasing the model's parameters ignoring to track the tokens that significantly influence losses, which is decisive for the optimization direction of the model in each iteration. To address this issue, we propose Focus Learning (FocusL), a novel learning approach that adjusts the contribution of each token to the optimization direction by directly scaling the corresponding objective loss. Specifically, we first introduce a positioning method by utilizing relevance distributions between knowledge and each response token to locate knowledge-aware tokens. Then, we further design a relevance-to-weight transformation to provide dynamic token-level weights for adjusting the cross-entropy loss. Finally, we use the weighted loss to encourage the model to pay special attention to the knowledge utilization. Experimental results demonstrate that our method achieves the new state-of-the-art results and generates more reliable responses while maintaining training stability.

AB - Maintaining faithfulness between responses and knowledge is an important research topic for building reliable knowledge-grounded dialogue systems. Existing models heavily rely on the elaborate data engineering and increasing the model's parameters ignoring to track the tokens that significantly influence losses, which is decisive for the optimization direction of the model in each iteration. To address this issue, we propose Focus Learning (FocusL), a novel learning approach that adjusts the contribution of each token to the optimization direction by directly scaling the corresponding objective loss. Specifically, we first introduce a positioning method by utilizing relevance distributions between knowledge and each response token to locate knowledge-aware tokens. Then, we further design a relevance-to-weight transformation to provide dynamic token-level weights for adjusting the cross-entropy loss. Finally, we use the weighted loss to encourage the model to pay special attention to the knowledge utilization. Experimental results demonstrate that our method achieves the new state-of-the-art results and generates more reliable responses while maintaining training stability.

UR - http://www.scopus.com/inward/record.url?scp=85174388434&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85174388434

T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics

SP - 4554

EP - 4566

BT - Long Papers

PB - Association for Computational Linguistics (ACL)

T2 - 61st Annual Meeting of the Association for Computational Linguistics, ACL 2023

Y2 - 9 July 2023 through 14 July 2023

ER -

Towards Faithful Dialogs via Focus Learning

摘要

出版系列

会议

其它文件与链接

指纹

引用此