Alleviating repetitive tokens in non-autoregressive machine translation with unlikelihood training

Shuheng Wang; Shumin Shi; Heyan Huang

doi:10.1007/s00500-023-09490-1

Alleviating repetitive tokens in non-autoregressive machine translation with unlikelihood training

Shuheng Wang, Shumin Shi^*, Heyan Huang

^*此作品的通讯作者

计算机学院

Nanyang Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In recent years, significant progress has been made in the field of non-autoregressive machine translations. However, the accuracy of non-autoregressive models still lags behind their autoregressive counterparts. This discrepancy can be attributed to the abundance of repetitive tokens in the target sequences generated by non-autoregressive models. In this study, we delve into this phenomenon and propose a novel approach to train a non-autoregressive model using unlikelihood loss. We evaluate our method on three widely used benchmark tasks. The experimental results demonstrating that our proposed approach significantly reduces the number of repetitive tokens while improving the overall performance of non-autoregressive machine translations. Compared to the baseline model ”Mask-Predict”, the average number of repetitions on IWSLT 14 DE→EN valid set is reduced from 0.48 to 0.17, resulting in a remarkable 62% decrease.

源语言	英语
页（从-至）	4681-4688
页数	8
期刊	Soft Computing
卷	28
期	5
DOI	https://doi.org/10.1007/s00500-023-09490-1
出版状态	已出版 - 3月 2024

访问文件

10.1007/s00500-023-09490-1

其它文件与链接

链接到 Scopus 的出版物

引用此

Wang, S., Shi, S., & Huang, H. (2024). Alleviating repetitive tokens in non-autoregressive machine translation with unlikelihood training. Soft Computing, 28(5), 4681-4688. https://doi.org/10.1007/s00500-023-09490-1

@article{129f76c8b0e64c8d8bb7c184395eea36,

title = "Alleviating repetitive tokens in non-autoregressive machine translation with unlikelihood training",

abstract = "In recent years, significant progress has been made in the field of non-autoregressive machine translations. However, the accuracy of non-autoregressive models still lags behind their autoregressive counterparts. This discrepancy can be attributed to the abundance of repetitive tokens in the target sequences generated by non-autoregressive models. In this study, we delve into this phenomenon and propose a novel approach to train a non-autoregressive model using unlikelihood loss. We evaluate our method on three widely used benchmark tasks. The experimental results demonstrating that our proposed approach significantly reduces the number of repetitive tokens while improving the overall performance of non-autoregressive machine translations. Compared to the baseline model ”Mask-Predict”, the average number of repetitions on IWSLT 14 DE→EN valid set is reduced from 0.48 to 0.17, resulting in a remarkable 62% decrease.",

keywords = "Machine translation, Non-autoregressive, Repetitive tokens, Unlikelihood training",

author = "Shuheng Wang and Shumin Shi and Heyan Huang",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.",

year = "2024",

month = mar,

doi = "10.1007/s00500-023-09490-1",

language = "English",

volume = "28",

pages = "4681--4688",

journal = "Soft Computing",

issn = "1432-7643",

publisher = "Springer Science and Business Media Deutschland GmbH",

number = "5",

}

TY - JOUR

T1 - Alleviating repetitive tokens in non-autoregressive machine translation with unlikelihood training

AU - Wang, Shuheng

AU - Shi, Shumin

AU - Huang, Heyan

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.

PY - 2024/3

Y1 - 2024/3

N2 - In recent years, significant progress has been made in the field of non-autoregressive machine translations. However, the accuracy of non-autoregressive models still lags behind their autoregressive counterparts. This discrepancy can be attributed to the abundance of repetitive tokens in the target sequences generated by non-autoregressive models. In this study, we delve into this phenomenon and propose a novel approach to train a non-autoregressive model using unlikelihood loss. We evaluate our method on three widely used benchmark tasks. The experimental results demonstrating that our proposed approach significantly reduces the number of repetitive tokens while improving the overall performance of non-autoregressive machine translations. Compared to the baseline model ”Mask-Predict”, the average number of repetitions on IWSLT 14 DE→EN valid set is reduced from 0.48 to 0.17, resulting in a remarkable 62% decrease.

AB - In recent years, significant progress has been made in the field of non-autoregressive machine translations. However, the accuracy of non-autoregressive models still lags behind their autoregressive counterparts. This discrepancy can be attributed to the abundance of repetitive tokens in the target sequences generated by non-autoregressive models. In this study, we delve into this phenomenon and propose a novel approach to train a non-autoregressive model using unlikelihood loss. We evaluate our method on three widely used benchmark tasks. The experimental results demonstrating that our proposed approach significantly reduces the number of repetitive tokens while improving the overall performance of non-autoregressive machine translations. Compared to the baseline model ”Mask-Predict”, the average number of repetitions on IWSLT 14 DE→EN valid set is reduced from 0.48 to 0.17, resulting in a remarkable 62% decrease.

KW - Machine translation

KW - Non-autoregressive

KW - Repetitive tokens

KW - Unlikelihood training

UR - http://www.scopus.com/inward/record.url?scp=85181252310&partnerID=8YFLogxK

U2 - 10.1007/s00500-023-09490-1

DO - 10.1007/s00500-023-09490-1

M3 - Article

AN - SCOPUS:85181252310

SN - 1432-7643

VL - 28

SP - 4681

EP - 4688

JO - Soft Computing

JF - Soft Computing

IS - 5

ER -

Alleviating repetitive tokens in non-autoregressive machine translation with unlikelihood training

摘要

访问文件

其它文件与链接

指纹

引用此