An approach to NMT re-ranking using sequence-labeling for grammatical error correction

Bo Wang, Kaoru Hirota, Chang Liu, Yaping Dai*, Zhiyang Jia

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

5 引用 (Scopus)

摘要

An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

源语言英语
页(从-至)557-567
页数11
期刊Journal of Advanced Computational Intelligence and Intelligent Informatics
24
4
DOI
出版状态已出版 - 7月 2020

指纹

探究 'An approach to NMT re-ranking using sequence-labeling for grammatical error correction' 的科研主题。它们共同构成独一无二的指纹。

引用此