Integrating pronunciation into Chinese-Vietnamese statistical machine translation

Anh Tran Huu, Heyan Huang, Yuhang Guo*, Shumin Shi, Ping Jian

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

: Statistical machine translation for low-resource language suffers from the lack of abundant training corpora. Several methods, such as the use of a pivot language, have been proposed as a bridge to translate from one language to another. However, errors will accumulate during the extensive translation pipelines. In this paper, we propose an approach to low-resource language translation by exploiting the pronunciation correlations between languages. We find that the pronunciation features can improve both Chinese-Vietnamese and Vietnamese-Chinese translation qualities. Experimental results show that our proposed model yields effective improvements, and the translation performance (bilingual evaluation understudy score) is improved by a maximum value of 1.03.

Original languageEnglish
Pages (from-to)715-723
Number of pages9
JournalTsinghua Science and Technology
Volume23
Issue number6
DOIs
Publication statusPublished - Dec 2018

Keywords

  • Chinese-Vietnamese machine translation
  • Low-resource languages
  • Pronunciation integration
  • Sino-Vietnamese words

Fingerprint

Dive into the research topics of 'Integrating pronunciation into Chinese-Vietnamese statistical machine translation'. Together they form a unique fingerprint.

Cite this