Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation

Tianfu Zhang, Heyan Huang, Chong Feng*, Longbing Cao

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

6 引用 (Scopus)

摘要

While various neural machine translation (NMT) methods have integrated mono-lingual syntax knowledge into the linguistic representation of sequence-to-sequence, no research is available on aligning the syntactic structures of target language with the corresponding source language syntactic structures. This work shows the first attempt of a sourcetarget bilingual syntactic alignment approach SyntAligner by mutual information maximization-based self-supervised neural deep modeling. Building on the word alignment for NMT, our SyntAligner firstly aligns the syntactic structures of source and target sentences and then maximizes their mutual dependency by introducing a lower bound on their mutual information. In SyntAligner, the syntactic structure of span granularity is represented by transforming source or target word hidden state into a source or target syntactic span vector. A border-sensitive span attention mechanism then captures the correlation between the source and target syntactic span vectors, which also captures the self-attention between span border-words as alignment bias. Lastly, a self-supervised bilingual syntactic mutual information maximization-based learning objective dynamically samples the aligned syntactic spans to maximize their mutual dependency. Experiment results on three typical NMT tasks: WMT'14 English!German, IWSLT'14 German!English, and NC'11 English!French show the SyntAligner effectiveness and universality of syntactic alignment.

源语言英语
主期刊名35th AAAI Conference on Artificial Intelligence, AAAI 2021
出版商Association for the Advancement of Artificial Intelligence
14454-14462
页数9
ISBN(电子版)9781713835974
出版状态已出版 - 2021
活动35th AAAI Conference on Artificial Intelligence, AAAI 2021 - Virtual, Online
期限: 2 2月 20219 2月 2021

出版系列

姓名35th AAAI Conference on Artificial Intelligence, AAAI 2021
16

会议

会议35th AAAI Conference on Artificial Intelligence, AAAI 2021
Virtual, Online
时期2/02/219/02/21

指纹

探究 'Self-supervised Bilingual Syntactic Alignment for Neural Machine Translation' 的科研主题。它们共同构成独一无二的指纹。

引用此