Incorporating target language semantic roles into a string-to-tree translation model

Chao Su; Yu hang Guo; He yan Huang; Shu min Shi; Chong Feng

doi:10.1631/FITEE.1601349

Incorporating target language semantic roles into a string-to-tree translation model

Chao Su^*, Yu hang Guo, He yan Huang, Shu min Shi, Chong Feng

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

The string-to-tree model is one of the most successful syntax-based statistical machine translation (SMT) models. It models the grammaticality of the output via target-side syntax. However, it does not use any semantic information and tends to produce translations containing semantic role confusions and error chunk sequences. In this paper, we propose two methods to use semantic roles to improve the performance of the string-to-tree translation model: (1) adding role labels in the syntax tree; (2) constructing a semantic role tree, and then incorporating the syntax information into it. We then perform string-to-tree machine translation using the newly generated trees. Our methods enable the system to train and choose better translation rules using semantic information. Our experiments showed significant improvements over the state-of-the-art string-to-tree translation system on both spoken and news corpora, and the two proposed methods surpass the phrase-based system on large-scale training data.

Original language	English
Pages (from-to)	1534-1542
Number of pages	9
Journal	Frontiers of Information Technology and Electronic Engineering
Volume	18
Issue number	10
DOIs	https://doi.org/10.1631/FITEE.1601349
Publication status	Published - 1 Oct 2017

Keywords

Machine translation
Semantic role
String-to-tree
Syntax tree

Access to Document

10.1631/FITEE.1601349

Cite this

@article{071f0f9873ed47a883a9aad86407937a,

title = "Incorporating target language semantic roles into a string-to-tree translation model",

abstract = "The string-to-tree model is one of the most successful syntax-based statistical machine translation (SMT) models. It models the grammaticality of the output via target-side syntax. However, it does not use any semantic information and tends to produce translations containing semantic role confusions and error chunk sequences. In this paper, we propose two methods to use semantic roles to improve the performance of the string-to-tree translation model: (1) adding role labels in the syntax tree; (2) constructing a semantic role tree, and then incorporating the syntax information into it. We then perform string-to-tree machine translation using the newly generated trees. Our methods enable the system to train and choose better translation rules using semantic information. Our experiments showed significant improvements over the state-of-the-art string-to-tree translation system on both spoken and news corpora, and the two proposed methods surpass the phrase-based system on large-scale training data.",

keywords = "Machine translation, Semantic role, String-to-tree, Syntax tree",

author = "Chao Su and Guo, {Yu hang} and Huang, {He yan} and Shi, {Shu min} and Chong Feng",

note = "Publisher Copyright: {\textcopyright} 2017, Zhejiang University and Springer-Verlag GmbH Germany, part of Springer Nature.",

year = "2017",

month = oct,

day = "1",

doi = "10.1631/FITEE.1601349",

language = "English",

volume = "18",

pages = "1534--1542",

journal = "Frontiers of Information Technology and Electronic Engineering",

issn = "2095-9184",

publisher = "Zhejiang University",

number = "10",

}

TY - JOUR

T1 - Incorporating target language semantic roles into a string-to-tree translation model

AU - Su, Chao

AU - Guo, Yu hang

AU - Huang, He yan

AU - Shi, Shu min

AU - Feng, Chong

PY - 2017/10/1

Y1 - 2017/10/1

N2 - The string-to-tree model is one of the most successful syntax-based statistical machine translation (SMT) models. It models the grammaticality of the output via target-side syntax. However, it does not use any semantic information and tends to produce translations containing semantic role confusions and error chunk sequences. In this paper, we propose two methods to use semantic roles to improve the performance of the string-to-tree translation model: (1) adding role labels in the syntax tree; (2) constructing a semantic role tree, and then incorporating the syntax information into it. We then perform string-to-tree machine translation using the newly generated trees. Our methods enable the system to train and choose better translation rules using semantic information. Our experiments showed significant improvements over the state-of-the-art string-to-tree translation system on both spoken and news corpora, and the two proposed methods surpass the phrase-based system on large-scale training data.

AB - The string-to-tree model is one of the most successful syntax-based statistical machine translation (SMT) models. It models the grammaticality of the output via target-side syntax. However, it does not use any semantic information and tends to produce translations containing semantic role confusions and error chunk sequences. In this paper, we propose two methods to use semantic roles to improve the performance of the string-to-tree translation model: (1) adding role labels in the syntax tree; (2) constructing a semantic role tree, and then incorporating the syntax information into it. We then perform string-to-tree machine translation using the newly generated trees. Our methods enable the system to train and choose better translation rules using semantic information. Our experiments showed significant improvements over the state-of-the-art string-to-tree translation system on both spoken and news corpora, and the two proposed methods surpass the phrase-based system on large-scale training data.

KW - Machine translation

KW - Semantic role

KW - String-to-tree

KW - Syntax tree

UR - http://www.scopus.com/inward/record.url?scp=85038208399&partnerID=8YFLogxK

U2 - 10.1631/FITEE.1601349

DO - 10.1631/FITEE.1601349

M3 - Article

AN - SCOPUS:85038208399

SN - 2095-9184

VL - 18

SP - 1534

EP - 1542

JO - Frontiers of Information Technology and Electronic Engineering

JF - Frontiers of Information Technology and Electronic Engineering

IS - 10

ER -

Incorporating target language semantic roles into a string-to-tree translation model

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this