Incorporating target language semantic roles into a string-to-tree translation model

Chao Su; Yu hang Guo; He yan Huang; Shu min Shi; Chong Feng

doi:10.1631/FITEE.1601349

Incorporating target language semantic roles into a string-to-tree translation model

Chao Su^*, Yu hang Guo, He yan Huang, Shu min Shi, Chong Feng

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

The string-to-tree model is one of the most successful syntax-based statistical machine translation (SMT) models. It models the grammaticality of the output via target-side syntax. However, it does not use any semantic information and tends to produce translations containing semantic role confusions and error chunk sequences. In this paper, we propose two methods to use semantic roles to improve the performance of the string-to-tree translation model: (1) adding role labels in the syntax tree; (2) constructing a semantic role tree, and then incorporating the syntax information into it. We then perform string-to-tree machine translation using the newly generated trees. Our methods enable the system to train and choose better translation rules using semantic information. Our experiments showed significant improvements over the state-of-the-art string-to-tree translation system on both spoken and news corpora, and the two proposed methods surpass the phrase-based system on large-scale training data.

源语言	英语
页（从-至）	1534-1542
页数	9
期刊	Frontiers of Information Technology and Electronic Engineering
卷	18
期	10
DOI	https://doi.org/10.1631/FITEE.1601349
出版状态	已出版 - 1 10月 2017

访问文件

10.1631/FITEE.1601349

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{071f0f9873ed47a883a9aad86407937a,

title = "Incorporating target language semantic roles into a string-to-tree translation model",

abstract = "The string-to-tree model is one of the most successful syntax-based statistical machine translation (SMT) models. It models the grammaticality of the output via target-side syntax. However, it does not use any semantic information and tends to produce translations containing semantic role confusions and error chunk sequences. In this paper, we propose two methods to use semantic roles to improve the performance of the string-to-tree translation model: (1) adding role labels in the syntax tree; (2) constructing a semantic role tree, and then incorporating the syntax information into it. We then perform string-to-tree machine translation using the newly generated trees. Our methods enable the system to train and choose better translation rules using semantic information. Our experiments showed significant improvements over the state-of-the-art string-to-tree translation system on both spoken and news corpora, and the two proposed methods surpass the phrase-based system on large-scale training data.",

keywords = "Machine translation, Semantic role, String-to-tree, Syntax tree",

author = "Chao Su and Guo, {Yu hang} and Huang, {He yan} and Shi, {Shu min} and Chong Feng",

note = "Publisher Copyright: {\textcopyright} 2017, Zhejiang University and Springer-Verlag GmbH Germany, part of Springer Nature.",

year = "2017",

month = oct,

day = "1",

doi = "10.1631/FITEE.1601349",

language = "English",

volume = "18",

pages = "1534--1542",

journal = "Frontiers of Information Technology and Electronic Engineering",

issn = "2095-9184",

publisher = "Zhejiang University",

number = "10",

}

TY - JOUR

T1 - Incorporating target language semantic roles into a string-to-tree translation model

AU - Su, Chao

AU - Guo, Yu hang

AU - Huang, He yan

AU - Shi, Shu min

AU - Feng, Chong

PY - 2017/10/1

Y1 - 2017/10/1

N2 - The string-to-tree model is one of the most successful syntax-based statistical machine translation (SMT) models. It models the grammaticality of the output via target-side syntax. However, it does not use any semantic information and tends to produce translations containing semantic role confusions and error chunk sequences. In this paper, we propose two methods to use semantic roles to improve the performance of the string-to-tree translation model: (1) adding role labels in the syntax tree; (2) constructing a semantic role tree, and then incorporating the syntax information into it. We then perform string-to-tree machine translation using the newly generated trees. Our methods enable the system to train and choose better translation rules using semantic information. Our experiments showed significant improvements over the state-of-the-art string-to-tree translation system on both spoken and news corpora, and the two proposed methods surpass the phrase-based system on large-scale training data.

AB - The string-to-tree model is one of the most successful syntax-based statistical machine translation (SMT) models. It models the grammaticality of the output via target-side syntax. However, it does not use any semantic information and tends to produce translations containing semantic role confusions and error chunk sequences. In this paper, we propose two methods to use semantic roles to improve the performance of the string-to-tree translation model: (1) adding role labels in the syntax tree; (2) constructing a semantic role tree, and then incorporating the syntax information into it. We then perform string-to-tree machine translation using the newly generated trees. Our methods enable the system to train and choose better translation rules using semantic information. Our experiments showed significant improvements over the state-of-the-art string-to-tree translation system on both spoken and news corpora, and the two proposed methods surpass the phrase-based system on large-scale training data.

KW - Machine translation

KW - Semantic role

KW - String-to-tree

KW - Syntax tree

UR - http://www.scopus.com/inward/record.url?scp=85038208399&partnerID=8YFLogxK

U2 - 10.1631/FITEE.1601349

DO - 10.1631/FITEE.1601349

M3 - Article

AN - SCOPUS:85038208399

SN - 2095-9184

VL - 18

SP - 1534

EP - 1542

JO - Frontiers of Information Technology and Electronic Engineering

JF - Frontiers of Information Technology and Electronic Engineering

IS - 10

ER -

Incorporating target language semantic roles into a string-to-tree translation model

摘要

访问文件

其它文件与链接

指纹

引用此