Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method

Xuewen Shi; Heyan Huang; Ping Jian; Yi Kun Tang

Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method

Xuewen Shi, Heyan Huang, Ping Jian^*, Yi Kun Tang

^*此作品的通讯作者

计算机学院

科研成果: 会议稿件 › 论文 › 同行评审

2 引用（Scopus）

摘要

Neural machine translation (NMT) usually employs beam search to expand the searching space and obtain more translation candidates. However, the increase of the beam size often suffers from plenty of short translations, resulting in dramatical decrease in translation quality. In this paper, we handle the length bias problem through a perspective of causal inference. Specifically, we regard the model generated translation score S as a degraded true translation quality affected by some noise, and one of the confounders is the translation length. We apply a Half-Sibling Regression method to remove the length effect on S, and then we can obtain a debiased translation score without length information. The proposed method is model agnostic and unsupervised, which is adaptive to any NMT model and test dataset. We conduct the experiments on three translation tasks with different scales of datasets. Experimental results and further analyses show that our approaches gain comparable performance with the empirical baseline methods.

源语言	英语
页	874-885
页数	12
出版状态	已出版 - 2021
活动	20th Chinese National Conference on Computational Linguistics, CCL 2021 - Hohhot, 中国期限: 13 8月 2021 → 15 8月 2021

会议

会议	20th Chinese National Conference on Computational Linguistics, CCL 2021
国家/地区	中国
市	Hohhot
时期	13/08/21 → 15/08/21

其它文件与链接

链接到 Scopus 的出版物

引用此

@conference{893b17a1cb2c49fba1d594feb0bd2e16,

title = "Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method",

abstract = "Neural machine translation (NMT) usually employs beam search to expand the searching space and obtain more translation candidates. However, the increase of the beam size often suffers from plenty of short translations, resulting in dramatical decrease in translation quality. In this paper, we handle the length bias problem through a perspective of causal inference. Specifically, we regard the model generated translation score S as a degraded true translation quality affected by some noise, and one of the confounders is the translation length. We apply a Half-Sibling Regression method to remove the length effect on S, and then we can obtain a debiased translation score without length information. The proposed method is model agnostic and unsupervised, which is adaptive to any NMT model and test dataset. We conduct the experiments on three translation tasks with different scales of datasets. Experimental results and further analyses show that our approaches gain comparable performance with the empirical baseline methods.",

author = "Xuewen Shi and Heyan Huang and Ping Jian and Tang, {Yi Kun}",

note = "Publisher Copyright: {\textcopyright} 2021 China National Conference on Computational Linguistics Published under Creative Commons Attribution 4.0 International License; 20th Chinese National Conference on Computational Linguistics, CCL 2021 ; Conference date: 13-08-2021 Through 15-08-2021",

year = "2021",

language = "English",

pages = "874--885",

}

TY - CONF

T1 - Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method

AU - Shi, Xuewen

AU - Huang, Heyan

AU - Jian, Ping

AU - Tang, Yi Kun

PY - 2021

Y1 - 2021

N2 - Neural machine translation (NMT) usually employs beam search to expand the searching space and obtain more translation candidates. However, the increase of the beam size often suffers from plenty of short translations, resulting in dramatical decrease in translation quality. In this paper, we handle the length bias problem through a perspective of causal inference. Specifically, we regard the model generated translation score S as a degraded true translation quality affected by some noise, and one of the confounders is the translation length. We apply a Half-Sibling Regression method to remove the length effect on S, and then we can obtain a debiased translation score without length information. The proposed method is model agnostic and unsupervised, which is adaptive to any NMT model and test dataset. We conduct the experiments on three translation tasks with different scales of datasets. Experimental results and further analyses show that our approaches gain comparable performance with the empirical baseline methods.

AB - Neural machine translation (NMT) usually employs beam search to expand the searching space and obtain more translation candidates. However, the increase of the beam size often suffers from plenty of short translations, resulting in dramatical decrease in translation quality. In this paper, we handle the length bias problem through a perspective of causal inference. Specifically, we regard the model generated translation score S as a degraded true translation quality affected by some noise, and one of the confounders is the translation length. We apply a Half-Sibling Regression method to remove the length effect on S, and then we can obtain a debiased translation score without length information. The proposed method is model agnostic and unsupervised, which is adaptive to any NMT model and test dataset. We conduct the experiments on three translation tasks with different scales of datasets. Experimental results and further analyses show that our approaches gain comparable performance with the empirical baseline methods.

UR - http://www.scopus.com/inward/record.url?scp=85123422485&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:85123422485

SP - 874

EP - 885

T2 - 20th Chinese National Conference on Computational Linguistics, CCL 2021

Y2 - 13 August 2021 through 15 August 2021

ER -

Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method

摘要

会议

其它文件与链接

指纹

引用此