基于词频效应控制的神经机器翻译用词多样性增强方法

Translated title of the contribution: Improving Word-level Diversity in Neural Machine Translation by Controlling the Effects of Word Frequency

Xuewen Shi, Ping Jian*, Yi Kun Tang, Heyan Huang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Neural machine translation (NMT) optimized by maximum likelihood estimation is prone to problems such as unargmaxable tokens or poor accuracy of low-frequency words, which leads to the lack of word-level diversity in the generated translations. The unbalanced distribution of word frequency on the training data is one of the reasons for the above phenomenon. This paper aims to alleviate the above problems by limiting the impact of word frequency on the estimated probability when decoding NMT. Specifically, we adopt a denoising framework of Half-Sibling Regression based on causal inference theory, combined with the adaptive denoising coefficient proposed in this paper to control the effect of word frequency on estimated probability, in order to obtain more accurate model estimated probability, and enrich the diversity of the words used in NMT translations. The experiments in this paper are carried out on four translation tasks representing different resource scales: Uyghur-Chinese, Chinese-English, English-German and English-French. In addition, the proposed method is model-agnostic and interpretable.

Translated title of the contributionImproving Word-level Diversity in Neural Machine Translation by Controlling the Effects of Word Frequency
Original languageChinese (Traditional)
Title of host publicationProceedings of the 22nd Chinese National Conference on Computational Linguistics, CCL 2023
EditorsMaosong Sun, Bing Qin, Xipeng Qiu, Jing Jiang, Xianpei Han
PublisherAssociation for Computational Linguistics (ACL)
Pages64-77
Number of pages14
ISBN (Electronic)9781713876229
Publication statusPublished - 2023
Event22nd Chinese National Conference on Computational Linguistics, CCL 2023 - Harbin, China
Duration: 3 Aug 20235 Aug 2023

Publication series

NameProceedings of the 22nd Chinese National Conference on Computational Linguistics, CCL 2023
Volume1

Conference

Conference22nd Chinese National Conference on Computational Linguistics, CCL 2023
Country/TerritoryChina
CityHarbin
Period3/08/235/08/23

Fingerprint

Dive into the research topics of 'Improving Word-level Diversity in Neural Machine Translation by Controlling the Effects of Word Frequency'. Together they form a unique fingerprint.

Cite this