Removing Input Confounder for Translation Quality Estimation via a Causal Motivated Method

Xuewen Shi, Heyan Huang, Ping Jian*, Yi Kun Tang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Most state-of-the-art QE systems built upon neural networks have achieved promising performances on benchmark datasets. However, the performance of these methods can be easily influenced by the inherent features of the model input, such as the length of input sequence or the number of unseen tokens. In this paper, we introduce a causal inference based method to eliminate the negative impact caused by the characters of the input for a QE system. Specifically, we propose an iterative denoising framework for multiple confounding features. The confounder elimination operation at each iteration step is implemented by a Half-Sibling Regression based method. We conduct our experiments on the official datasets and submissions from WMT 2020 Quality Estimation Shared Task of Sentence-Level Direct Assessment. Experimental results show that the denoised QE results gain better Pearson’s correlation scores with human assessments compared to the original submissions.

源语言英语
主期刊名Web and Big Data - 5th International Joint Conference, APWeb-WAIM 2021, Proceedings
编辑Leong Hou U, Marc Spaniol, Yasushi Sakurai, Junying Chen
出版商Springer Science and Business Media Deutschland GmbH
358-364
页数7
ISBN(印刷版)9783030858957
DOI
出版状态已出版 - 2021
活动5th International Joint Conference on Asia-Pacific Web and Web-Age Information Management, APWeb-WAIM 2021 - Guangzhou, 中国
期限: 23 8月 202125 8月 2021

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12858 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议5th International Joint Conference on Asia-Pacific Web and Web-Age Information Management, APWeb-WAIM 2021
国家/地区中国
Guangzhou
时期23/08/2125/08/21

指纹

探究 'Removing Input Confounder for Translation Quality Estimation via a Causal Motivated Method' 的科研主题。它们共同构成独一无二的指纹。

引用此