Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method

Xuewen Shi, Heyan Huang, Ping Jian*, Yi Kun Tang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Neural machine translation (NMT) usually employs beam search to expand the searching space and obtain more translation candidates. However, the increase of the beam size often suffers from plenty of short translations, resulting in dramatical decrease in translation quality. In this paper, we handle the length bias problem through a perspective of causal inference. Specifically, we regard the model generated translation score S as a degraded true translation quality affected by some noise, and one of the confounders is the translation length. We apply a Half-Sibling Regression method to remove the length effect on S, and then we can obtain a debiased translation score without length information. The proposed method is model agnostic and unsupervised, which is adaptive to any NMT model and test dataset. We conduct the experiments on three translation tasks with different scales of datasets. Experimental results and further analyses show that our approaches gain comparable performance with the empirical baseline methods.

源语言英语
主期刊名Chinese Computational Linguistics - 20th China National Conference, CCL 2021, Proceedings
编辑Sheng Li, Maosong Sun, Yang Liu, Hua Wu, Liu Kang, Wanxiang Che, Shizhu He, Gaoqi Rao
出版商Springer Science and Business Media Deutschland GmbH
3-15
页数13
ISBN(印刷版)9783030841850
DOI
出版状态已出版 - 2021
活动20th China National Conference on Computational Linguistics, CCL 2021 - Virtual, Online
期限: 13 8月 202115 8月 2021

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12869 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议20th China National Conference on Computational Linguistics, CCL 2021
Virtual, Online
时期13/08/2115/08/21

指纹

探究 'Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method' 的科研主题。它们共同构成独一无二的指纹。

引用此