Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method

Xuewen Shi, Heyan Huang, Ping Jian*, Yi Kun Tang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Neural machine translation (NMT) usually employs beam search to expand the searching space and obtain more translation candidates. However, the increase of the beam size often suffers from plenty of short translations, resulting in dramatical decrease in translation quality. In this paper, we handle the length bias problem through a perspective of causal inference. Specifically, we regard the model generated translation score S as a degraded true translation quality affected by some noise, and one of the confounders is the translation length. We apply a Half-Sibling Regression method to remove the length effect on S, and then we can obtain a debiased translation score without length information. The proposed method is model agnostic and unsupervised, which is adaptive to any NMT model and test dataset. We conduct the experiments on three translation tasks with different scales of datasets. Experimental results and further analyses show that our approaches gain comparable performance with the empirical baseline methods.

Original languageEnglish
Title of host publicationChinese Computational Linguistics - 20th China National Conference, CCL 2021, Proceedings
EditorsSheng Li, Maosong Sun, Yang Liu, Hua Wu, Liu Kang, Wanxiang Che, Shizhu He, Gaoqi Rao
PublisherSpringer Science and Business Media Deutschland GmbH
Pages3-15
Number of pages13
ISBN (Print)9783030841850
DOIs
Publication statusPublished - 2021
Event20th China National Conference on Computational Linguistics, CCL 2021 - Virtual, Online
Duration: 13 Aug 202115 Aug 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12869 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th China National Conference on Computational Linguistics, CCL 2021
CityVirtual, Online
Period13/08/2115/08/21

Keywords

  • Causal inference
  • Half-sibling regression
  • Machine translation

Fingerprint

Dive into the research topics of 'Reducing Length Bias in Scoring Neural Machine Translation via a Causal Inference Method'. Together they form a unique fingerprint.

Cite this