TY - JOUR
T1 - Integrating pronunciation into Chinese-Vietnamese statistical machine translation
AU - Huu, Anh Tran
AU - Huang, Heyan
AU - Guo, Yuhang
AU - Shi, Shumin
AU - Jian, Ping
N1 - Publisher Copyright:
© 2018 Tsinghua University Press. All rights reserved.
PY - 2018/12
Y1 - 2018/12
N2 - : Statistical machine translation for low-resource language suffers from the lack of abundant training corpora. Several methods, such as the use of a pivot language, have been proposed as a bridge to translate from one language to another. However, errors will accumulate during the extensive translation pipelines. In this paper, we propose an approach to low-resource language translation by exploiting the pronunciation correlations between languages. We find that the pronunciation features can improve both Chinese-Vietnamese and Vietnamese-Chinese translation qualities. Experimental results show that our proposed model yields effective improvements, and the translation performance (bilingual evaluation understudy score) is improved by a maximum value of 1.03.
AB - : Statistical machine translation for low-resource language suffers from the lack of abundant training corpora. Several methods, such as the use of a pivot language, have been proposed as a bridge to translate from one language to another. However, errors will accumulate during the extensive translation pipelines. In this paper, we propose an approach to low-resource language translation by exploiting the pronunciation correlations between languages. We find that the pronunciation features can improve both Chinese-Vietnamese and Vietnamese-Chinese translation qualities. Experimental results show that our proposed model yields effective improvements, and the translation performance (bilingual evaluation understudy score) is improved by a maximum value of 1.03.
KW - Chinese-Vietnamese machine translation
KW - Low-resource languages
KW - Pronunciation integration
KW - Sino-Vietnamese words
UR - http://www.scopus.com/inward/record.url?scp=85055853297&partnerID=8YFLogxK
U2 - 10.26599/TST.2018.9010006
DO - 10.26599/TST.2018.9010006
M3 - Article
AN - SCOPUS:85055853297
SN - 1007-0214
VL - 23
SP - 715
EP - 723
JO - Tsinghua Science and Technology
JF - Tsinghua Science and Technology
IS - 6
ER -