Robustness-Eva-MRC: Assessing and analyzing the robustness of neural models in extractive machine reading comprehension

Jingliang Fang; Hua Xu; Zhijing Wu; Kai Gao; Xiaoyin Che; Haotian Hui

doi:10.1016/j.iswa.2023.200287

Robustness-Eva-MRC: Assessing and analyzing the robustness of neural models in extractive machine reading comprehension

Jingliang Fang, Hua Xu, Zhijing Wu^*, Kai Gao, Xiaoyin Che, Haotian Hui

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Deep neural networks, despite their remarkable success in various language understanding tasks, have been found vulnerable to adversarial attacks and subtle input perturbations, revealing a robustness shortfall. To explore this, this paper presents Robustness-Eva-MRC, an interactive platform designed to assess and analyze the robustness of pre-trained and large-scale language models in extractive machine reading comprehension (MRC) tasks. The platform integrates eight adversarial attack methods across character-, word-, and sentence-levels, and applies them to five MRC datasets, thereby fabricating challenging adversarial testing sets. Then it evaluates the MRC models on both original and adversarial sets, yielding insights into their robustness through performance gaps. Moreover, Robustness-Eva-MRC provides comprehensive visualizations and detailed case studies, enhancing the understanding of model robustness. A screencast video and additional material are available at https://github.com/distantJing/Robustness-Eva-MRC.

源语言	英语
文章编号	200287
期刊	Intelligent Systems with Applications
卷	20
DOI	https://doi.org/10.1016/j.iswa.2023.200287
出版状态	已出版 - 11月 2023
已对外发布	是

访问文件

10.1016/j.iswa.2023.200287

其它文件与链接

链接到 Scopus 的出版物

引用此

Fang, J., Xu, H., Wu, Z., Gao, K., Che, X., & Hui, H. (2023). Robustness-Eva-MRC: Assessing and analyzing the robustness of neural models in extractive machine reading comprehension. Intelligent Systems with Applications, 20, 文章 200287. https://doi.org/10.1016/j.iswa.2023.200287

@article{63824cf2bfde47b18f13d856473d5252,

title = "Robustness-Eva-MRC: Assessing and analyzing the robustness of neural models in extractive machine reading comprehension",

abstract = "Deep neural networks, despite their remarkable success in various language understanding tasks, have been found vulnerable to adversarial attacks and subtle input perturbations, revealing a robustness shortfall. To explore this, this paper presents Robustness-Eva-MRC, an interactive platform designed to assess and analyze the robustness of pre-trained and large-scale language models in extractive machine reading comprehension (MRC) tasks. The platform integrates eight adversarial attack methods across character-, word-, and sentence-levels, and applies them to five MRC datasets, thereby fabricating challenging adversarial testing sets. Then it evaluates the MRC models on both original and adversarial sets, yielding insights into their robustness through performance gaps. Moreover, Robustness-Eva-MRC provides comprehensive visualizations and detailed case studies, enhancing the understanding of model robustness. A screencast video and additional material are available at https://github.com/distantJing/Robustness-Eva-MRC.",

keywords = "Analysis, Extractive machine reading comprehension, Robustness",

author = "Jingliang Fang and Hua Xu and Zhijing Wu and Kai Gao and Xiaoyin Che and Haotian Hui",

note = "Publisher Copyright: {\textcopyright} 2023 The Authors",

year = "2023",

month = nov,

doi = "10.1016/j.iswa.2023.200287",

language = "English",

volume = "20",

journal = "Intelligent Systems with Applications",

issn = "2667-3053",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Robustness-Eva-MRC

T2 - Assessing and analyzing the robustness of neural models in extractive machine reading comprehension

AU - Fang, Jingliang

AU - Xu, Hua

AU - Wu, Zhijing

AU - Gao, Kai

AU - Che, Xiaoyin

AU - Hui, Haotian

PY - 2023/11

Y1 - 2023/11

N2 - Deep neural networks, despite their remarkable success in various language understanding tasks, have been found vulnerable to adversarial attacks and subtle input perturbations, revealing a robustness shortfall. To explore this, this paper presents Robustness-Eva-MRC, an interactive platform designed to assess and analyze the robustness of pre-trained and large-scale language models in extractive machine reading comprehension (MRC) tasks. The platform integrates eight adversarial attack methods across character-, word-, and sentence-levels, and applies them to five MRC datasets, thereby fabricating challenging adversarial testing sets. Then it evaluates the MRC models on both original and adversarial sets, yielding insights into their robustness through performance gaps. Moreover, Robustness-Eva-MRC provides comprehensive visualizations and detailed case studies, enhancing the understanding of model robustness. A screencast video and additional material are available at https://github.com/distantJing/Robustness-Eva-MRC.

AB - Deep neural networks, despite their remarkable success in various language understanding tasks, have been found vulnerable to adversarial attacks and subtle input perturbations, revealing a robustness shortfall. To explore this, this paper presents Robustness-Eva-MRC, an interactive platform designed to assess and analyze the robustness of pre-trained and large-scale language models in extractive machine reading comprehension (MRC) tasks. The platform integrates eight adversarial attack methods across character-, word-, and sentence-levels, and applies them to five MRC datasets, thereby fabricating challenging adversarial testing sets. Then it evaluates the MRC models on both original and adversarial sets, yielding insights into their robustness through performance gaps. Moreover, Robustness-Eva-MRC provides comprehensive visualizations and detailed case studies, enhancing the understanding of model robustness. A screencast video and additional material are available at https://github.com/distantJing/Robustness-Eva-MRC.

KW - Analysis

KW - Extractive machine reading comprehension

KW - Robustness

UR - http://www.scopus.com/inward/record.url?scp=85174576085&partnerID=8YFLogxK

U2 - 10.1016/j.iswa.2023.200287

DO - 10.1016/j.iswa.2023.200287

M3 - Article

AN - SCOPUS:85174576085

SN - 2667-3053

VL - 20

JO - Intelligent Systems with Applications

JF - Intelligent Systems with Applications

M1 - 200287

ER -

Robustness-Eva-MRC: Assessing and analyzing the robustness of neural models in extractive machine reading comprehension

摘要

访问文件

其它文件与链接

指纹

引用此