MGRC: An End-to-End Multigranularity Reading Comprehension Model for Question Answering

Qian Liu; Xiubo Geng; Heyan Huang; Tao Qin; Jie Lu; Daxin Jiang

doi:10.1109/TNNLS.2021.3107029

MGRC: An End-to-End Multigranularity Reading Comprehension Model for Question Answering

Qian Liu, Xiubo Geng^*, Heyan Huang, Tao Qin, Jie Lu, Daxin Jiang^*

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

6 引用（Scopus）

摘要

Deep neural network-based models have achieved great success in extractive question answering. Recently, many works have been proposed to model multistage matching for this task, which usually first retrieve relevant paragraphs or sentences and then extract an answer span from the retrieved results. However, such a pipeline-based approach suffers from the error propagation problem, especially for sentence-level retrieval that is usually difficult to achieve high accuracy due to the severe data imbalance problem. Furthermore, since the paragraph/sentence selector and the answer extractor are closely related, modeling them independently does not fully exploit the power of multistage matching. To solve these problems, we propose a novel end-to-end multigranularity reading comprehension model, which is a unified framework to explicitly model three matching granularities, including paragraph identification, sentence selection, and answer extraction. Our approach has two main advantages. First, the end-to-end approach alleviates the error propagation problem in both the training and inference phases. Second, the shared features in a unified model improve the learning of representations of different matching granularities. We conduct a comprehensive comparison on four large-scale datasets (SQuAD-open, NewsQA, SQuAD 2.0, and SQuAD Adversarial) and verify that the proposed approach outperforms both the vanilla BERT model and existing multistage matching approaches. We also conduct an ablation study and verify the effectiveness of the proposed components in our model structure.

源语言	英语
页（从-至）	2594-2605
页数	12
期刊	IEEE Transactions on Neural Networks and Learning Systems
卷	34
期	5
DOI	https://doi.org/10.1109/TNNLS.2021.3107029
出版状态	已出版 - 1 5月 2023

访问文件

10.1109/TNNLS.2021.3107029

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{92b7a0aec15047e28c34cef56316e189,

title = "MGRC: An End-to-End Multigranularity Reading Comprehension Model for Question Answering",

abstract = "Deep neural network-based models have achieved great success in extractive question answering. Recently, many works have been proposed to model multistage matching for this task, which usually first retrieve relevant paragraphs or sentences and then extract an answer span from the retrieved results. However, such a pipeline-based approach suffers from the error propagation problem, especially for sentence-level retrieval that is usually difficult to achieve high accuracy due to the severe data imbalance problem. Furthermore, since the paragraph/sentence selector and the answer extractor are closely related, modeling them independently does not fully exploit the power of multistage matching. To solve these problems, we propose a novel end-to-end multigranularity reading comprehension model, which is a unified framework to explicitly model three matching granularities, including paragraph identification, sentence selection, and answer extraction. Our approach has two main advantages. First, the end-to-end approach alleviates the error propagation problem in both the training and inference phases. Second, the shared features in a unified model improve the learning of representations of different matching granularities. We conduct a comprehensive comparison on four large-scale datasets (SQuAD-open, NewsQA, SQuAD 2.0, and SQuAD Adversarial) and verify that the proposed approach outperforms both the vanilla BERT model and existing multistage matching approaches. We also conduct an ablation study and verify the effectiveness of the proposed components in our model structure.",

keywords = "Machine reading comprehension (MRC), natural language processing, question answering",

author = "Qian Liu and Xiubo Geng and Heyan Huang and Tao Qin and Jie Lu and Daxin Jiang",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2023",

month = may,

day = "1",

doi = "10.1109/TNNLS.2021.3107029",

language = "English",

volume = "34",

pages = "2594--2605",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "5",

}

TY - JOUR

T1 - MGRC

T2 - An End-to-End Multigranularity Reading Comprehension Model for Question Answering

AU - Liu, Qian

AU - Geng, Xiubo

AU - Huang, Heyan

AU - Qin, Tao

AU - Lu, Jie

AU - Jiang, Daxin

PY - 2023/5/1

Y1 - 2023/5/1

N2 - Deep neural network-based models have achieved great success in extractive question answering. Recently, many works have been proposed to model multistage matching for this task, which usually first retrieve relevant paragraphs or sentences and then extract an answer span from the retrieved results. However, such a pipeline-based approach suffers from the error propagation problem, especially for sentence-level retrieval that is usually difficult to achieve high accuracy due to the severe data imbalance problem. Furthermore, since the paragraph/sentence selector and the answer extractor are closely related, modeling them independently does not fully exploit the power of multistage matching. To solve these problems, we propose a novel end-to-end multigranularity reading comprehension model, which is a unified framework to explicitly model three matching granularities, including paragraph identification, sentence selection, and answer extraction. Our approach has two main advantages. First, the end-to-end approach alleviates the error propagation problem in both the training and inference phases. Second, the shared features in a unified model improve the learning of representations of different matching granularities. We conduct a comprehensive comparison on four large-scale datasets (SQuAD-open, NewsQA, SQuAD 2.0, and SQuAD Adversarial) and verify that the proposed approach outperforms both the vanilla BERT model and existing multistage matching approaches. We also conduct an ablation study and verify the effectiveness of the proposed components in our model structure.

AB - Deep neural network-based models have achieved great success in extractive question answering. Recently, many works have been proposed to model multistage matching for this task, which usually first retrieve relevant paragraphs or sentences and then extract an answer span from the retrieved results. However, such a pipeline-based approach suffers from the error propagation problem, especially for sentence-level retrieval that is usually difficult to achieve high accuracy due to the severe data imbalance problem. Furthermore, since the paragraph/sentence selector and the answer extractor are closely related, modeling them independently does not fully exploit the power of multistage matching. To solve these problems, we propose a novel end-to-end multigranularity reading comprehension model, which is a unified framework to explicitly model three matching granularities, including paragraph identification, sentence selection, and answer extraction. Our approach has two main advantages. First, the end-to-end approach alleviates the error propagation problem in both the training and inference phases. Second, the shared features in a unified model improve the learning of representations of different matching granularities. We conduct a comprehensive comparison on four large-scale datasets (SQuAD-open, NewsQA, SQuAD 2.0, and SQuAD Adversarial) and verify that the proposed approach outperforms both the vanilla BERT model and existing multistage matching approaches. We also conduct an ablation study and verify the effectiveness of the proposed components in our model structure.

KW - Machine reading comprehension (MRC)

KW - natural language processing

KW - question answering

UR - http://www.scopus.com/inward/record.url?scp=85114719216&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2021.3107029

DO - 10.1109/TNNLS.2021.3107029

M3 - Article

C2 - 34478387

AN - SCOPUS:85114719216

SN - 2162-237X

VL - 34

SP - 2594

EP - 2605

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 5

ER -

MGRC: An End-to-End Multigranularity Reading Comprehension Model for Question Answering

摘要

访问文件

其它文件与链接

指纹

引用此