Routing Based Context Selection for Document-Level Neural Machine Translation

Weilun Fei, Ping Jian*, Xiaoguang Zhu, Yi Lin

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Most of the existing methods of document-level neural machine translation (NMT) integrate more textual information by extending the scope of sentence encoding. Usually, the sentence-level representation is incorporated (via attention or gate mechanism) in these methods, which makes them straightforward but rough, and it is difficult to distinguish useful contextual information from noises. Furthermore, the longer the encoding length is, the more difficult it is for the model to grasp the inter-dependency between sentences. In this paper, a document-level NMT method based on a routing algorithm is presented, which can automatically select context information. The routing mechanism endows the current source sentence with the ability to decide which words can become its context. This leads the method to merge the inter-sentence dependencies in a more flexible and elegant way, and model local structure information more effectively. At the same time, this structured information selection mechanism will also alleviate the possible problems caused by long-distance encoding. Experimental results show that our method is 2.91 BLEU higher than the Transformer model on the public dataset of ZH-EN, and is superior to most of the state-of-the-art document-level NMT models.

Original languageEnglish
Title of host publicationMachine Translation - 17th China Conference, CCMT 2021, Revised Selected Papers
EditorsJinsong Su, Rico Sennrich
PublisherSpringer Science and Business Media Deutschland GmbH
Pages77-91
Number of pages15
ISBN (Print)9789811675119
DOIs
Publication statusPublished - 2021
Event17th China Conference on Machine Translation, CCMT 2021 - Xining, China
Duration: 8 Oct 202110 Oct 2021

Publication series

NameCommunications in Computer and Information Science
Volume1464 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference17th China Conference on Machine Translation, CCMT 2021
Country/TerritoryChina
CityXining
Period8/10/2110/10/21

Keywords

  • Document-Level Neural Machine Translation
  • Natural Language Processing
  • Routing Algorithm

Fingerprint

Dive into the research topics of 'Routing Based Context Selection for Document-Level Neural Machine Translation'. Together they form a unique fingerprint.

Cite this