跳到主要导航 跳到搜索 跳到主要内容

基于词频效应控制的神经机器翻译用词多样性增强方法

  • Dongbei University of Finance and Economics
  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Neural machine translation (NMT) optimized by maximum likelihood estimation is prone to problems such as unargmaxable tokens or poor accuracy of low-frequency words, which leads to the lack of word-level diversity in the generated translations. The unbalanced distribution of word frequency on the training data is one of the reasons for the above phenomenon. This paper aims to alleviate the above problems by limiting the impact of word frequency on the estimated probability when decoding NMT. Specifically, we adopt a denoising framework of Half-Sibling Regression based on causal inference theory, combined with the adaptive denoising coefficient proposed in this paper to control the effect of word frequency on estimated probability, in order to obtain more accurate model estimated probability, and enrich the diversity of the words used in NMT translations. The experiments in this paper are carried out on four translation tasks representing different resource scales: Uyghur-Chinese, Chinese-English, English-German and English-French. In addition, the proposed method is model-agnostic and interpretable.

投稿的翻译标题Improving Word-level Diversity in Neural Machine Translation by Controlling the Effects of Word Frequency
源语言繁体中文
主期刊名Proceedings of the 22nd Chinese National Conference on Computational Linguistics, CCL 2023
编辑Maosong Sun, Bing Qin, Xipeng Qiu, Jing Jiang, Xianpei Han
出版商Association for Computational Linguistics (ACL)
64-77
页数14
ISBN(电子版)9781713876229
出版状态已出版 - 2023
活动22nd Chinese National Conference on Computational Linguistics, CCL 2023 - Harbin, 中国
期限: 3 8月 20235 8月 2023

出版系列

姓名Proceedings of the 22nd Chinese National Conference on Computational Linguistics, CCL 2023
1

会议

会议22nd Chinese National Conference on Computational Linguistics, CCL 2023
国家/地区中国
Harbin
时期3/08/235/08/23

关键词

  • Causal inference
  • Neural machine translation
  • Translation diversity

指纹

探究 '基于词频效应控制的神经机器翻译用词多样性增强方法' 的科研主题。它们共同构成独一无二的指纹。

引用此