Case-Sensitive Neural Machine Translation

Xuewen Shi; Heyan Huang; Ping Jian; Yi Kun Tang

doi:10.1007/978-3-030-47426-3_51

Case-Sensitive Neural Machine Translation

Xuewen Shi, Heyan Huang, Ping Jian^*, Yi Kun Tang

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

2 引用（Scopus）

摘要

Even as an important lexical information for Latin languages, word case is often ignored in machine translation. According to observations, the translation performance drops significantly when we introduce case-sensitive evaluation metrics. In this paper, we introduce two types of case-sensitive neural machine translation (NMT) approaches to alleviate the above problems: i) adding case tokens into the decoding sequence, and ii) adopting case prediction to the conventional NMT. Our proposed approaches incorporate case information to the NMT decoder by jointly learning target word generation and word case prediction. We compare our approaches with multiple kinds of baselines including NMT with naive case-restoration methods and analyze the impacts of various setups on our approaches. Experimental results on three typical translation tasks (Zh-En, En-Fr, En-De) show that our proposed methods lead to the improvements up to 2.5, 1.0 and 0.5 in case-sensitive BLEU scores respectively. Further analyses also illustrate the inherent reasons why our approaches lead to different improvements on different translation tasks.

源语言	英语
主期刊名	Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference, PAKDD 2020, Proceedings
编辑	Hady W. Lauw, Ee-Peng Lim, Raymond Chi-Wing Wong, Alexandros Ntoulas, See-Kiong Ng, Sinno Jialin Pan
出版商	Springer
页	662-674
页数	13
ISBN（印刷版）	9783030474256
DOI	https://doi.org/10.1007/978-3-030-47426-3_51
出版状态	已出版 - 2020
活动	24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020 - Singapore, 新加坡期限: 11 5月 2020 → 14 5月 2020

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	12084 LNAI
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020
国家/地区	新加坡
市	Singapore
时期	11/05/20 → 14/05/20

访问文件

10.1007/978-3-030-47426-3_51

其它文件与链接

链接到 Scopus 的出版物

引用此

Shi, X., Huang, H., Jian, P., & Tang, Y. K. (2020). Case-Sensitive Neural Machine Translation. 在 H. W. Lauw, E.-P. Lim, R. C.-W. Wong, A. Ntoulas, S.-K. Ng, & S. J. Pan (编辑), Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference, PAKDD 2020, Proceedings (页码 662-674). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 12084 LNAI). Springer. https://doi.org/10.1007/978-3-030-47426-3_51

Shi, Xuewen ; Huang, Heyan ; Jian, Ping 等. / Case-Sensitive Neural Machine Translation. Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference, PAKDD 2020, Proceedings. 编辑 / Hady W. Lauw ; Ee-Peng Lim ; Raymond Chi-Wing Wong ; Alexandros Ntoulas ; See-Kiong Ng ; Sinno Jialin Pan. Springer, 2020. 页码 662-674 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{6462403279e042d69c71adacc2a9b44c,

title = "Case-Sensitive Neural Machine Translation",

abstract = "Even as an important lexical information for Latin languages, word case is often ignored in machine translation. According to observations, the translation performance drops significantly when we introduce case-sensitive evaluation metrics. In this paper, we introduce two types of case-sensitive neural machine translation (NMT) approaches to alleviate the above problems: i) adding case tokens into the decoding sequence, and ii) adopting case prediction to the conventional NMT. Our proposed approaches incorporate case information to the NMT decoder by jointly learning target word generation and word case prediction. We compare our approaches with multiple kinds of baselines including NMT with naive case-restoration methods and analyze the impacts of various setups on our approaches. Experimental results on three typical translation tasks (Zh-En, En-Fr, En-De) show that our proposed methods lead to the improvements up to 2.5, 1.0 and 0.5 in case-sensitive BLEU scores respectively. Further analyses also illustrate the inherent reasons why our approaches lead to different improvements on different translation tasks.",

keywords = "Case-sensitive, Natural language processing, Neural machine translation",

author = "Xuewen Shi and Heyan Huang and Ping Jian and Tang, {Yi Kun}",

note = "Publisher Copyright: {\textcopyright} Springer Nature Switzerland AG 2020.; 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020 ; Conference date: 11-05-2020 Through 14-05-2020",

year = "2020",

doi = "10.1007/978-3-030-47426-3_51",

language = "English",

isbn = "9783030474256",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "662--674",

editor = "Lauw, {Hady W.} and Ee-Peng Lim and Wong, {Raymond Chi-Wing} and Alexandros Ntoulas and See-Kiong Ng and Pan, {Sinno Jialin}",

booktitle = "Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference, PAKDD 2020, Proceedings",

address = "Germany",

}

Shi, X, Huang, H, Jian, P & Tang, YK 2020, Case-Sensitive Neural Machine Translation. 在 HW Lauw, E-P Lim, RC-W Wong, A Ntoulas, S-K Ng & SJ Pan (编辑), Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference, PAKDD 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 12084 LNAI, Springer, 页码 662-674, 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020, Singapore, 新加坡, 11/05/20. https://doi.org/10.1007/978-3-030-47426-3_51

Case-Sensitive Neural Machine Translation. / Shi, Xuewen; Huang, Heyan; Jian, Ping 等.
Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference, PAKDD 2020, Proceedings. 编辑 / Hady W. Lauw; Ee-Peng Lim; Raymond Chi-Wing Wong; Alexandros Ntoulas; See-Kiong Ng; Sinno Jialin Pan. Springer, 2020. 页码 662-674 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 12084 LNAI).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Case-Sensitive Neural Machine Translation

AU - Shi, Xuewen

AU - Huang, Heyan

AU - Jian, Ping

AU - Tang, Yi Kun

N1 - Publisher Copyright: © Springer Nature Switzerland AG 2020.

PY - 2020

Y1 - 2020

N2 - Even as an important lexical information for Latin languages, word case is often ignored in machine translation. According to observations, the translation performance drops significantly when we introduce case-sensitive evaluation metrics. In this paper, we introduce two types of case-sensitive neural machine translation (NMT) approaches to alleviate the above problems: i) adding case tokens into the decoding sequence, and ii) adopting case prediction to the conventional NMT. Our proposed approaches incorporate case information to the NMT decoder by jointly learning target word generation and word case prediction. We compare our approaches with multiple kinds of baselines including NMT with naive case-restoration methods and analyze the impacts of various setups on our approaches. Experimental results on three typical translation tasks (Zh-En, En-Fr, En-De) show that our proposed methods lead to the improvements up to 2.5, 1.0 and 0.5 in case-sensitive BLEU scores respectively. Further analyses also illustrate the inherent reasons why our approaches lead to different improvements on different translation tasks.

AB - Even as an important lexical information for Latin languages, word case is often ignored in machine translation. According to observations, the translation performance drops significantly when we introduce case-sensitive evaluation metrics. In this paper, we introduce two types of case-sensitive neural machine translation (NMT) approaches to alleviate the above problems: i) adding case tokens into the decoding sequence, and ii) adopting case prediction to the conventional NMT. Our proposed approaches incorporate case information to the NMT decoder by jointly learning target word generation and word case prediction. We compare our approaches with multiple kinds of baselines including NMT with naive case-restoration methods and analyze the impacts of various setups on our approaches. Experimental results on three typical translation tasks (Zh-En, En-Fr, En-De) show that our proposed methods lead to the improvements up to 2.5, 1.0 and 0.5 in case-sensitive BLEU scores respectively. Further analyses also illustrate the inherent reasons why our approaches lead to different improvements on different translation tasks.

KW - Case-sensitive

KW - Natural language processing

KW - Neural machine translation

UR - http://www.scopus.com/inward/record.url?scp=85085732838&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-47426-3_51

DO - 10.1007/978-3-030-47426-3_51

M3 - Conference contribution

AN - SCOPUS:85085732838

SN - 9783030474256

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 662

EP - 674

BT - Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference, PAKDD 2020, Proceedings

A2 - Lauw, Hady W.

A2 - Lim, Ee-Peng

A2 - Wong, Raymond Chi-Wing

A2 - Ntoulas, Alexandros

A2 - Ng, See-Kiong

A2 - Pan, Sinno Jialin

PB - Springer

T2 - 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020

Y2 - 11 May 2020 through 14 May 2020

ER -

Shi X, Huang H, Jian P, Tang YK. Case-Sensitive Neural Machine Translation. 在 Lauw HW, Lim EP, Wong RCW, Ntoulas A, Ng SK, Pan SJ, 编辑, Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference, PAKDD 2020, Proceedings. Springer. 2020. 页码 662-674. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-47426-3_51

Case-Sensitive Neural Machine Translation

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此