Improving Non-autoregressive Machine Translation with Soft-Masking

Shuheng Wang, Shumin Shi*, Heyan Huang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

In recent years, non-autoregressive machine translation has achieved great success due to its promising inference speedup. Non-autoregressive machine translation reduces the decoding latency by generating the target words in single-pass. However, there is a considerable gap in the accuracy between non-autoregressive machine translation and autoregressive machine translation. Because it removes the dependencies between the target words, non-autoregressive machine translation tends to generate repetitive words or wrong words, and these repetitive or wrong words lead to low performance. In this paper, we introduce a soft-masking method to alleviate this issue. Specifically, we introduce an autoregressive discriminator, which will output the probabilities hinting which embeddings are correct. Then according to the probabilities, we add mask on the copied representations, which enables the model to consider which words are easy to be predicted. We evaluated our method on three benchmarks, including WMT14 EN → DE, WMT16 EN → RO, and IWSLT14 DE → EN. The experimental results demonstrate that our method can outperform the baseline by a large margin with a bit of speed sacrifice.

源语言英语
主期刊名Natural Language Processing and Chinese Computing - 10th CCF International Conference, NLPCC 2021, Proceedings
编辑Lu Wang, Yansong Feng, Yu Hong, Ruifang He
出版商Springer Science and Business Media Deutschland GmbH
141-152
页数12
ISBN(印刷版)9783030884796
DOI
出版状态已出版 - 2021
活动10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021 - Qingdao, 中国
期限: 13 10月 202117 10月 2021

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13028 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议10th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2021
国家/地区中国
Qingdao
时期13/10/2117/10/21

指纹

探究 'Improving Non-autoregressive Machine Translation with Soft-Masking' 的科研主题。它们共同构成独一无二的指纹。

引用此