Improving Neural Machine Translation by Transferring Knowledge from Syntactic Constituent Alignment Learning

Chao Su, Heyan Huang, Shumin Shi*, Ping Jian

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Statistical machine translation (SMT) models rely on word-, phrase-, and syntax-level alignments. But neural machine translation (NMT) models rarely explicitly learn the phrase- and syntax-level alignments. In this article, we propose to improve NMT by explicitly learning the bilingual syntactic constituent alignments. Specifically, we first utilize syntactic parsers to induce syntactic structures of sentences, and then we propose two ways to utilize the syntactic constituents in a perceptual (not adversarial) generator-discriminator training framework. One way is to use them to measure the alignment score of sentence-level training examples, and the other is to directly score the alignments of constituent-level examples generated with an algorithm based on word-level alignments from SMT. In our generator-discriminator framework, the discriminator is pre-trained to learn constituent alignments and distinguish the ground-truth translation from the fake ones, while the generative translation model is fine-tuned to receive the alignment knowledge and to generate translations that best approximate the true ones. Experiments and analysis show that the learned constituent alignments can help improve the translation results.

Original languageEnglish
Article number91
JournalACM Transactions on Asian and Low-Resource Language Information Processing
Volume21
Issue number5
DOIs
Publication statusPublished - 29 Apr 2022

Keywords

  • Neural machine translation
  • constituent alignment
  • discriminator-generator framework
  • syntactic constituent

Fingerprint

Dive into the research topics of 'Improving Neural Machine Translation by Transferring Knowledge from Syntactic Constituent Alignment Learning'. Together they form a unique fingerprint.

Cite this