An approach to NMT re-ranking using sequence-labeling for grammatical error correction

Bo Wang; Kaoru Hirota; Chang Liu; Yaping Dai; Zhiyang Jia

doi:10.20965/jaciii.2020.p0557

An approach to NMT re-ranking using sequence-labeling for grammatical error correction

Bo Wang, Kaoru Hirota, Chang Liu, Yaping Dai^*, Zhiyang Jia

^*此作品的通讯作者

自动化学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

5 引用（Scopus）

摘要

An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F_0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

源语言	英语
页（从-至）	557-567
页数	11
期刊	Journal of Advanced Computational Intelligence and Intelligent Informatics
卷	24
期	4
DOI	https://doi.org/10.20965/jaciii.2020.p0557
出版状态	已出版 - 7月 2020

访问文件

10.20965/jaciii.2020.p0557

其它文件与链接

链接到 Scopus 的出版物

引用此

Wang, B., Hirota, K., Liu, C., Dai, Y., & Jia, Z. (2020). An approach to NMT re-ranking using sequence-labeling for grammatical error correction. Journal of Advanced Computational Intelligence and Intelligent Informatics, 24(4), 557-567. https://doi.org/10.20965/jaciii.2020.p0557

@article{6a3166fa2971483f9a2f10af6837afc2,

title = "An approach to NMT re-ranking using sequence-labeling for grammatical error correction",

abstract = "An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.",

keywords = "Grammatical error correction, Neural machine translation, Sequence-labeling, Transformer",

author = "Bo Wang and Kaoru Hirota and Chang Liu and Yaping Dai and Zhiyang Jia",

year = "2020",

month = jul,

doi = "10.20965/jaciii.2020.p0557",

language = "English",

volume = "24",

pages = "557--567",

journal = "Journal of Advanced Computational Intelligence and Intelligent Informatics",

issn = "1343-0130",

publisher = "Fuji Technology Press",

number = "4",

}

TY - JOUR

T1 - An approach to NMT re-ranking using sequence-labeling for grammatical error correction

AU - Wang, Bo

AU - Hirota, Kaoru

AU - Liu, Chang

AU - Dai, Yaping

AU - Jia, Zhiyang

PY - 2020/7

Y1 - 2020/7

N2 - An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

AB - An approach to N-best hypotheses re-ranking using a sequence-labeling model is applied to resolve the data deficiency problem in Grammatical Error Correction (GEC). Multiple candidate sentences are generated using a Neural Machine Translation (NMT) model; thereafter, these sentences are re-ranked via a stacked Transformer following a Bidirectional Long Short-Term Memory (BiLSTM) with Conditional Random Field (CRF). Correlations within the sentences are extracted using the sequence-labeling model based on the Transformer, which is particularly suitable for long sentences. Meanwhile, the knowledge from a large amount of unlabeled data is acquired through the pre-trained structure. Thus, completely revised sentences are adopted instead of partially modified sentences. Compared with conventional NMT, experiments on the NUCLE and FCE datasets demonstrate that the model improves the F0.5 score by 8.22% and 2.09%, respectively. As an advantage, the proposed re-ranking method has the advantage of only requires a small set of easily computed features that do not need linguistic inputs.

KW - Grammatical error correction

KW - Neural machine translation

KW - Sequence-labeling

KW - Transformer

UR - http://www.scopus.com/inward/record.url?scp=85089576655&partnerID=8YFLogxK

U2 - 10.20965/jaciii.2020.p0557

DO - 10.20965/jaciii.2020.p0557

M3 - Article

AN - SCOPUS:85089576655

SN - 1343-0130

VL - 24

SP - 557

EP - 567

JO - Journal of Advanced Computational Intelligence and Intelligent Informatics

JF - Journal of Advanced Computational Intelligence and Intelligent Informatics

IS - 4

ER -

An approach to NMT re-ranking using sequence-labeling for grammatical error correction

摘要

访问文件

其它文件与链接

指纹

引用此