Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation

Tianfu Zhang; Heyan Huang; Chong Feng; Longbing Cao

Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation

Tianfu Zhang, Heyan Huang, Chong Feng^*, Longbing Cao

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

6 Citations (Scopus)

Abstract

While various neural machine translation (NMT) methods have integrated mono-lingual syntax knowledge into the linguistic representation of sequence-to-sequence, no research is available on aligning the syntactic structures of target language with the corresponding source language syntactic structures. This work shows the first attempt of a sourcetarget bilingual syntactic alignment approach SyntAligner by mutual information maximization-based self-supervised neural deep modeling. Building on the word alignment for NMT, our SyntAligner firstly aligns the syntactic structures of source and target sentences and then maximizes their mutual dependency by introducing a lower bound on their mutual information. In SyntAligner, the syntactic structure of span granularity is represented by transforming source or target word hidden state into a source or target syntactic span vector. A border-sensitive span attention mechanism then captures the correlation between the source and target syntactic span vectors, which also captures the self-attention between span border-words as alignment bias. Lastly, a self-supervised bilingual syntactic mutual information maximization-based learning objective dynamically samples the aligned syntactic spans to maximize their mutual dependency. Experiment results on three typical NMT tasks: WMT'14 English!German, IWSLT'14 German!English, and NC'11 English!French show the SyntAligner effectiveness and universality of syntactic alignment.

Original language	English
Title of host publication	35th AAAI Conference on Artificial Intelligence, AAAI 2021
Publisher	Association for the Advancement of Artificial Intelligence
Pages	14454-14462
Number of pages	9
ISBN (Electronic)	9781713835974
Publication status	Published - 2021
Event	35th AAAI Conference on Artificial Intelligence, AAAI 2021 - Virtual, Online Duration: 2 Feb 2021 → 9 Feb 2021

Publication series

Name	35th AAAI Conference on Artificial Intelligence, AAAI 2021
Volume	16

Conference

Conference	35th AAAI Conference on Artificial Intelligence, AAAI 2021
City	Virtual, Online
Period	2/02/21 → 9/02/21

Cite this

@inproceedings{c4e826fbc26844bf84fa3fd53dac7915,

title = "Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation",

abstract = "While various neural machine translation (NMT) methods have integrated mono-lingual syntax knowledge into the linguistic representation of sequence-to-sequence, no research is available on aligning the syntactic structures of target language with the corresponding source language syntactic structures. This work shows the first attempt of a sourcetarget bilingual syntactic alignment approach SyntAligner by mutual information maximization-based self-supervised neural deep modeling. Building on the word alignment for NMT, our SyntAligner firstly aligns the syntactic structures of source and target sentences and then maximizes their mutual dependency by introducing a lower bound on their mutual information. In SyntAligner, the syntactic structure of span granularity is represented by transforming source or target word hidden state into a source or target syntactic span vector. A border-sensitive span attention mechanism then captures the correlation between the source and target syntactic span vectors, which also captures the self-attention between span border-words as alignment bias. Lastly, a self-supervised bilingual syntactic mutual information maximization-based learning objective dynamically samples the aligned syntactic spans to maximize their mutual dependency. Experiment results on three typical NMT tasks: WMT'14 English!German, IWSLT'14 German!English, and NC'11 English!French show the SyntAligner effectiveness and universality of syntactic alignment.",

author = "Tianfu Zhang and Heyan Huang and Chong Feng and Longbing Cao",

note = "Publisher Copyright: Copyright {\textcopyright} 2021, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.; 35th AAAI Conference on Artificial Intelligence, AAAI 2021 ; Conference date: 02-02-2021 Through 09-02-2021",

year = "2021",

language = "English",

series = "35th AAAI Conference on Artificial Intelligence, AAAI 2021",

publisher = "Association for the Advancement of Artificial Intelligence",

pages = "14454--14462",

booktitle = "35th AAAI Conference on Artificial Intelligence, AAAI 2021",

}

Zhang, T, Huang, H , Feng, C & Cao, L 2021, Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation. in 35th AAAI Conference on Artificial Intelligence, AAAI 2021. 35th AAAI Conference on Artificial Intelligence, AAAI 2021, vol. 16, Association for the Advancement of Artificial Intelligence, pp. 14454-14462, 35th AAAI Conference on Artificial Intelligence, AAAI 2021, Virtual, Online, 2/02/21.

Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation. / Zhang, Tianfu; Huang, Heyan ; Feng, Chong et al.
35th AAAI Conference on Artificial Intelligence, AAAI 2021. Association for the Advancement of Artificial Intelligence, 2021. p. 14454-14462 (35th AAAI Conference on Artificial Intelligence, AAAI 2021; Vol. 16).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation

AU - Zhang, Tianfu

AU - Huang, Heyan

AU - Feng, Chong

AU - Cao, Longbing

PY - 2021

Y1 - 2021

N2 - While various neural machine translation (NMT) methods have integrated mono-lingual syntax knowledge into the linguistic representation of sequence-to-sequence, no research is available on aligning the syntactic structures of target language with the corresponding source language syntactic structures. This work shows the first attempt of a sourcetarget bilingual syntactic alignment approach SyntAligner by mutual information maximization-based self-supervised neural deep modeling. Building on the word alignment for NMT, our SyntAligner firstly aligns the syntactic structures of source and target sentences and then maximizes their mutual dependency by introducing a lower bound on their mutual information. In SyntAligner, the syntactic structure of span granularity is represented by transforming source or target word hidden state into a source or target syntactic span vector. A border-sensitive span attention mechanism then captures the correlation between the source and target syntactic span vectors, which also captures the self-attention between span border-words as alignment bias. Lastly, a self-supervised bilingual syntactic mutual information maximization-based learning objective dynamically samples the aligned syntactic spans to maximize their mutual dependency. Experiment results on three typical NMT tasks: WMT'14 English!German, IWSLT'14 German!English, and NC'11 English!French show the SyntAligner effectiveness and universality of syntactic alignment.

AB - While various neural machine translation (NMT) methods have integrated mono-lingual syntax knowledge into the linguistic representation of sequence-to-sequence, no research is available on aligning the syntactic structures of target language with the corresponding source language syntactic structures. This work shows the first attempt of a sourcetarget bilingual syntactic alignment approach SyntAligner by mutual information maximization-based self-supervised neural deep modeling. Building on the word alignment for NMT, our SyntAligner firstly aligns the syntactic structures of source and target sentences and then maximizes their mutual dependency by introducing a lower bound on their mutual information. In SyntAligner, the syntactic structure of span granularity is represented by transforming source or target word hidden state into a source or target syntactic span vector. A border-sensitive span attention mechanism then captures the correlation between the source and target syntactic span vectors, which also captures the self-attention between span border-words as alignment bias. Lastly, a self-supervised bilingual syntactic mutual information maximization-based learning objective dynamically samples the aligned syntactic spans to maximize their mutual dependency. Experiment results on three typical NMT tasks: WMT'14 English!German, IWSLT'14 German!English, and NC'11 English!French show the SyntAligner effectiveness and universality of syntactic alignment.

UR - http://www.scopus.com/inward/record.url?scp=85122739518&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85122739518

T3 - 35th AAAI Conference on Artificial Intelligence, AAAI 2021

SP - 14454

EP - 14462

BT - 35th AAAI Conference on Artificial Intelligence, AAAI 2021

PB - Association for the Advancement of Artificial Intelligence

T2 - 35th AAAI Conference on Artificial Intelligence, AAAI 2021

Y2 - 2 February 2021 through 9 February 2021

ER -

Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation

Abstract

Publication series

Conference

Other files and links

Fingerprint

Cite this