Improving Non-autoregressive Machine Translation with Soft-Masking

Shuheng Wang, Shumin Shi*, Heyan Huang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

In recent years, non-autoregressive machine translation has achieved great success due to its promising inference speedup. Non-autoregressive machine translation reduces the decoding latency by generating the target words in single-pass. However, there is a considerable gap in the accuracy between non-autoregressive machine translation and autoregressive machine translation. Because it removes the dependencies between the target words, non-autoregressive machine translation tends to generate repetitive words or wrong words, and these repetitive or wrong words lead to low performance. In this paper, we introduce a soft-masking method to alleviate this issue. Specifically, we introduce an autoregressive discriminator, which will output the probabilities hinting which embeddings are correct. Then according to the probabilities, we add mask on the copied representations, which enables the model to consider which words are easy to be predicted. We evaluated our method on three benchmarks, including WMT14 EN → DE, WMT16 EN → RO, and IWSLT14 DE → EN. The experimental results demonstrate that our method can outperform the baseline by a large margin with a bit of speed sacrifice.

Original languageEnglish
Title of host publicationNatural Language Processing and Chinese Computing - 10th CCF International Conference, NLPCC 2021, Proceedings
EditorsLu Wang, Yansong Feng, Yu Hong, Ruifang He
PublisherSpringer Science and Business Media Deutschland GmbH
Pages141-152
Number of pages12
ISBN (Print)9783030884796
DOIs
Publication statusPublished - 2021
Event10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021 - Qingdao, China
Duration: 13 Oct 202117 Oct 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13028 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021
Country/TerritoryChina
CityQingdao
Period13/10/2117/10/21

Keywords

  • Machine translation
  • Non-autoregressive
  • Soft-masking

Fingerprint

Dive into the research topics of 'Improving Non-autoregressive Machine Translation with Soft-Masking'. Together they form a unique fingerprint.

Cite this