Routing Based Context Selection for Document-Level Neural Machine Translation

Weilun Fei, Ping Jian*, Xiaoguang Zhu, Yi Lin

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Most of the existing methods of document-level neural machine translation (NMT) integrate more textual information by extending the scope of sentence encoding. Usually, the sentence-level representation is incorporated (via attention or gate mechanism) in these methods, which makes them straightforward but rough, and it is difficult to distinguish useful contextual information from noises. Furthermore, the longer the encoding length is, the more difficult it is for the model to grasp the inter-dependency between sentences. In this paper, a document-level NMT method based on a routing algorithm is presented, which can automatically select context information. The routing mechanism endows the current source sentence with the ability to decide which words can become its context. This leads the method to merge the inter-sentence dependencies in a more flexible and elegant way, and model local structure information more effectively. At the same time, this structured information selection mechanism will also alleviate the possible problems caused by long-distance encoding. Experimental results show that our method is 2.91 BLEU higher than the Transformer model on the public dataset of ZH-EN, and is superior to most of the state-of-the-art document-level NMT models.

源语言英语
主期刊名Machine Translation - 17th China Conference, CCMT 2021, Revised Selected Papers
编辑Jinsong Su, Rico Sennrich
出版商Springer Science and Business Media Deutschland GmbH
77-91
页数15
ISBN(印刷版)9789811675119
DOI
出版状态已出版 - 2021
活动17th China Conference on Machine Translation, CCMT 2021 - Xining, 中国
期限: 8 10月 202110 10月 2021

出版系列

姓名Communications in Computer and Information Science
1464 CCIS
ISSN(印刷版)1865-0929
ISSN(电子版)1865-0937

会议

会议17th China Conference on Machine Translation, CCMT 2021
国家/地区中国
Xining
时期8/10/2110/10/21

指纹

探究 'Routing Based Context Selection for Document-Level Neural Machine Translation' 的科研主题。它们共同构成独一无二的指纹。

引用此