An approach to NMT re-ranking using sequence-labeling for grammatical error correction

Bo Wang; Kaoru Hirota; Chang Liu; Yaping Dai; Zhiyang Jia

doi:10.20965/jaciii.2020.p0557

An approach to NMT re-ranking using sequence-labeling for grammatical error correction

Bo Wang, Kaoru Hirota, Chang Liu, Yaping Dai^*, Zhiyang Jia

^*Corresponding author for this work

School of Automation

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

5 Citations (Scopus)

Abstract

An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F_0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

Original language	English
Pages (from-to)	557-567
Number of pages	11
Journal	Journal of Advanced Computational Intelligence and Intelligent Informatics
Volume	24
Issue number	4
DOIs	https://doi.org/10.20965/jaciii.2020.p0557
Publication status	Published - Jul 2020

Keywords

Grammatical error correction
Neural machine translation
Sequence-labeling
Transformer

Access to Document

10.20965/jaciii.2020.p0557

Cite this

@article{6a3166fa2971483f9a2f10af6837afc2,

title = "An approach to NMT re-ranking using sequence-labeling for grammatical error correction",

abstract = "An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.",

keywords = "Grammatical error correction, Neural machine translation, Sequence-labeling, Transformer",

author = "Bo Wang and Kaoru Hirota and Chang Liu and Yaping Dai and Zhiyang Jia",

year = "2020",

month = jul,

doi = "10.20965/jaciii.2020.p0557",

language = "English",

volume = "24",

pages = "557--567",

journal = "Journal of Advanced Computational Intelligence and Intelligent Informatics",

issn = "1343-0130",

publisher = "Fuji Technology Press",

number = "4",

}

TY - JOUR

T1 - An approach to NMT re-ranking using sequence-labeling for grammatical error correction

AU - Wang, Bo

AU - Hirota, Kaoru

AU - Liu, Chang

AU - Dai, Yaping

AU - Jia, Zhiyang

PY - 2020/7

Y1 - 2020/7

N2 - An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

AB - An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

KW - Grammatical error correction

KW - Neural machine translation

KW - Sequence-labeling

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85089576655&partnerID=8YFLogxK

U2 - 10.20965/jaciii.2020.p0557

DO - 10.20965/jaciii.2020.p0557

M3 - Article

AN - SCOPUS:85089576655

SN - 1343-0130

VL - 24

SP - 557

EP - 567

JO - Journal of Advanced Computational Intelligence and Intelligent Informatics

JF - Journal of Advanced Computational Intelligence and Intelligent Informatics

IS - 4

ER -

An approach to NMT re-ranking using sequence-labeling for grammatical error correction

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this