TY - JOUR
T1 - Robustness-Eva-MRC
T2 - Assessing and analyzing the robustness of neural models in extractive machine reading comprehension
AU - Fang, Jingliang
AU - Xu, Hua
AU - Wu, Zhijing
AU - Gao, Kai
AU - Che, Xiaoyin
AU - Hui, Haotian
N1 - Publisher Copyright:
© 2023 The Authors
PY - 2023/11
Y1 - 2023/11
N2 - Deep neural networks, despite their remarkable success in various language understanding tasks, have been found vulnerable to adversarial attacks and subtle input perturbations, revealing a robustness shortfall. To explore this, this paper presents Robustness-Eva-MRC, an interactive platform designed to assess and analyze the robustness of pre-trained and large-scale language models in extractive machine reading comprehension (MRC) tasks. The platform integrates eight adversarial attack methods across character-, word-, and sentence-levels, and applies them to five MRC datasets, thereby fabricating challenging adversarial testing sets. Then it evaluates the MRC models on both original and adversarial sets, yielding insights into their robustness through performance gaps. Moreover, Robustness-Eva-MRC provides comprehensive visualizations and detailed case studies, enhancing the understanding of model robustness. A screencast video and additional material are available at https://github.com/distantJing/Robustness-Eva-MRC.
AB - Deep neural networks, despite their remarkable success in various language understanding tasks, have been found vulnerable to adversarial attacks and subtle input perturbations, revealing a robustness shortfall. To explore this, this paper presents Robustness-Eva-MRC, an interactive platform designed to assess and analyze the robustness of pre-trained and large-scale language models in extractive machine reading comprehension (MRC) tasks. The platform integrates eight adversarial attack methods across character-, word-, and sentence-levels, and applies them to five MRC datasets, thereby fabricating challenging adversarial testing sets. Then it evaluates the MRC models on both original and adversarial sets, yielding insights into their robustness through performance gaps. Moreover, Robustness-Eva-MRC provides comprehensive visualizations and detailed case studies, enhancing the understanding of model robustness. A screencast video and additional material are available at https://github.com/distantJing/Robustness-Eva-MRC.
KW - Analysis
KW - Extractive machine reading comprehension
KW - Robustness
UR - http://www.scopus.com/inward/record.url?scp=85174576085&partnerID=8YFLogxK
U2 - 10.1016/j.iswa.2023.200287
DO - 10.1016/j.iswa.2023.200287
M3 - Article
AN - SCOPUS:85174576085
SN - 2667-3053
VL - 20
JO - Intelligent Systems with Applications
JF - Intelligent Systems with Applications
M1 - 200287
ER -