Improving neural machine translation with sentence alignment learning

Xuewen Shi; Heyan Huang; Ping Jian; Yi Kun Tang

doi:10.1016/j.neucom.2020.05.104

Improving neural machine translation with sentence alignment learning

Xuewen Shi, Heyan Huang, Ping Jian^*, Yi Kun Tang

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

24 Citations (Scopus)

Abstract

Neural machine translation (NMT) optimized by maximum likelihood estimation (MLE) usually lacks the guarantee of translation adequacy. To alleviate this problem, we propose an NMT approach that heightens the adequacy in machine translation by transferring the semantic knowledge from bilingual sentence alignment learning. Specifically, we first design a discriminator that learns to estimate sentence aligning score over translation candidates. The discriminator is constructed by gated self-attention based sentence encoders and trained with an N-pair loss for better capturing lexical evidences from bilingual sentence pairs. Then we propose an adversarial training framework as well as a sentence alignment-aware decoding method for NMT to transfer the discriminator's learned semantic knowledge to NMT models. We conduct our experiments on Chinese → English, Uyghur → Chinese and English → German translation tasks. Experimental results show that our proposed methods outperform baseline NMT models on all these three translation tasks. Further analysis also indicates the characteristics of our approaches and details the semantic knowledge that transfered from the discriminator to the NMT model.

Original language	English
Pages (from-to)	15-26
Number of pages	12
Journal	Neurocomputing
Volume	420
DOIs	https://doi.org/10.1016/j.neucom.2020.05.104
Publication status	Published - 8 Jan 2021

Keywords

Adversarial training
Neural machine translation
Sentence alignment

Access to Document

10.1016/j.neucom.2020.05.104

Cite this

@article{c23e5c9cbbc549039774dfc819fb3e01,

title = "Improving neural machine translation with sentence alignment learning",

abstract = "Neural machine translation (NMT) optimized by maximum likelihood estimation (MLE) usually lacks the guarantee of translation adequacy. To alleviate this problem, we propose an NMT approach that heightens the adequacy in machine translation by transferring the semantic knowledge from bilingual sentence alignment learning. Specifically, we first design a discriminator that learns to estimate sentence aligning score over translation candidates. The discriminator is constructed by gated self-attention based sentence encoders and trained with an N-pair loss for better capturing lexical evidences from bilingual sentence pairs. Then we propose an adversarial training framework as well as a sentence alignment-aware decoding method for NMT to transfer the discriminator's learned semantic knowledge to NMT models. We conduct our experiments on Chinese → English, Uyghur → Chinese and English → German translation tasks. Experimental results show that our proposed methods outperform baseline NMT models on all these three translation tasks. Further analysis also indicates the characteristics of our approaches and details the semantic knowledge that transfered from the discriminator to the NMT model.",

keywords = "Adversarial training, Neural machine translation, Sentence alignment",

author = "Xuewen Shi and Heyan Huang and Ping Jian and Tang, {Yi Kun}",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier B.V.",

year = "2021",

month = jan,

day = "8",

doi = "10.1016/j.neucom.2020.05.104",

language = "English",

volume = "420",

pages = "15--26",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Improving neural machine translation with sentence alignment learning

AU - Shi, Xuewen

AU - Huang, Heyan

AU - Jian, Ping

AU - Tang, Yi Kun

PY - 2021/1/8

Y1 - 2021/1/8

N2 - Neural machine translation (NMT) optimized by maximum likelihood estimation (MLE) usually lacks the guarantee of translation adequacy. To alleviate this problem, we propose an NMT approach that heightens the adequacy in machine translation by transferring the semantic knowledge from bilingual sentence alignment learning. Specifically, we first design a discriminator that learns to estimate sentence aligning score over translation candidates. The discriminator is constructed by gated self-attention based sentence encoders and trained with an N-pair loss for better capturing lexical evidences from bilingual sentence pairs. Then we propose an adversarial training framework as well as a sentence alignment-aware decoding method for NMT to transfer the discriminator's learned semantic knowledge to NMT models. We conduct our experiments on Chinese → English, Uyghur → Chinese and English → German translation tasks. Experimental results show that our proposed methods outperform baseline NMT models on all these three translation tasks. Further analysis also indicates the characteristics of our approaches and details the semantic knowledge that transfered from the discriminator to the NMT model.

AB - Neural machine translation (NMT) optimized by maximum likelihood estimation (MLE) usually lacks the guarantee of translation adequacy. To alleviate this problem, we propose an NMT approach that heightens the adequacy in machine translation by transferring the semantic knowledge from bilingual sentence alignment learning. Specifically, we first design a discriminator that learns to estimate sentence aligning score over translation candidates. The discriminator is constructed by gated self-attention based sentence encoders and trained with an N-pair loss for better capturing lexical evidences from bilingual sentence pairs. Then we propose an adversarial training framework as well as a sentence alignment-aware decoding method for NMT to transfer the discriminator's learned semantic knowledge to NMT models. We conduct our experiments on Chinese → English, Uyghur → Chinese and English → German translation tasks. Experimental results show that our proposed methods outperform baseline NMT models on all these three translation tasks. Further analysis also indicates the characteristics of our approaches and details the semantic knowledge that transfered from the discriminator to the NMT model.

KW - Adversarial training

KW - Neural machine translation

KW - Sentence alignment

UR - http://www.scopus.com/inward/record.url?scp=85092107631&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2020.05.104

DO - 10.1016/j.neucom.2020.05.104

M3 - Article

AN - SCOPUS:85092107631

SN - 0925-2312

VL - 420

SP - 15

EP - 26

JO - Neurocomputing

JF - Neurocomputing

ER -

Improving neural machine translation with sentence alignment learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this