Adaptive iterative attack towards explainable adversarial robustness

Yucheng Shi; Yahong Han; Quanxin Zhang; Xiaohui Kuang

doi:10.1016/j.patcog.2020.107309

Adaptive iterative attack towards explainable adversarial robustness

Yucheng Shi, Yahong Han^*, Quanxin Zhang, Xiaohui Kuang

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

46 引用（Scopus）

摘要

Image classifiers based on deep neural networks show severe vulnerability when facing adversarial examples crafted on purpose. Designing more effective and efficient adversarial attacks is attracting considerable interest due to its potential contribution to interpretability of deep learning and validation of neural networks’ robustness. However, current iterative attacks use a fixed step size for each noise-adding step, making further investigation into the effect of variable step size on model robustness ripe for exploration. We prove that if the upper bound of noise added to the original image is fixed, the attack effect can be improved if the step size is positively correlated with the gradient obtained at each step by querying the target model. In this paper, we propose Ada-FGSM (Adaptive FGSM), a new iterative attack that adaptively allocates step size of noises according to gradient information at each step. Improvement of success rate and accuracy decrease measured on ImageNet with multiple models emphasizes the validity of our method. We analyze the process of iterative attack by visualizing their trajectory and gradient contour, and further explain the vulnerability of deep neural networks to variable step size adversarial examples.

源语言	英语
文章编号	107309
期刊	Pattern Recognition
卷	105
DOI	https://doi.org/10.1016/j.patcog.2020.107309
出版状态	已出版 - 9月 2020

访问文件

10.1016/j.patcog.2020.107309

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{d42ef76b712f4545ae2f151bf65e3792,

title = "Adaptive iterative attack towards explainable adversarial robustness",

abstract = "Image classifiers based on deep neural networks show severe vulnerability when facing adversarial examples crafted on purpose. Designing more effective and efficient adversarial attacks is attracting considerable interest due to its potential contribution to interpretability of deep learning and validation of neural networks{\textquoteright} robustness. However, current iterative attacks use a fixed step size for each noise-adding step, making further investigation into the effect of variable step size on model robustness ripe for exploration. We prove that if the upper bound of noise added to the original image is fixed, the attack effect can be improved if the step size is positively correlated with the gradient obtained at each step by querying the target model. In this paper, we propose Ada-FGSM (Adaptive FGSM), a new iterative attack that adaptively allocates step size of noises according to gradient information at each step. Improvement of success rate and accuracy decrease measured on ImageNet with multiple models emphasizes the validity of our method. We analyze the process of iterative attack by visualizing their trajectory and gradient contour, and further explain the vulnerability of deep neural networks to variable step size adversarial examples.",

keywords = "Adversarial attack, Adversarial example, Image classification",

author = "Yucheng Shi and Yahong Han and Quanxin Zhang and Xiaohui Kuang",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier Ltd",

year = "2020",

month = sep,

doi = "10.1016/j.patcog.2020.107309",

language = "English",

volume = "105",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Adaptive iterative attack towards explainable adversarial robustness

AU - Shi, Yucheng

AU - Han, Yahong

AU - Zhang, Quanxin

AU - Kuang, Xiaohui

PY - 2020/9

Y1 - 2020/9

N2 - Image classifiers based on deep neural networks show severe vulnerability when facing adversarial examples crafted on purpose. Designing more effective and efficient adversarial attacks is attracting considerable interest due to its potential contribution to interpretability of deep learning and validation of neural networks’ robustness. However, current iterative attacks use a fixed step size for each noise-adding step, making further investigation into the effect of variable step size on model robustness ripe for exploration. We prove that if the upper bound of noise added to the original image is fixed, the attack effect can be improved if the step size is positively correlated with the gradient obtained at each step by querying the target model. In this paper, we propose Ada-FGSM (Adaptive FGSM), a new iterative attack that adaptively allocates step size of noises according to gradient information at each step. Improvement of success rate and accuracy decrease measured on ImageNet with multiple models emphasizes the validity of our method. We analyze the process of iterative attack by visualizing their trajectory and gradient contour, and further explain the vulnerability of deep neural networks to variable step size adversarial examples.

AB - Image classifiers based on deep neural networks show severe vulnerability when facing adversarial examples crafted on purpose. Designing more effective and efficient adversarial attacks is attracting considerable interest due to its potential contribution to interpretability of deep learning and validation of neural networks’ robustness. However, current iterative attacks use a fixed step size for each noise-adding step, making further investigation into the effect of variable step size on model robustness ripe for exploration. We prove that if the upper bound of noise added to the original image is fixed, the attack effect can be improved if the step size is positively correlated with the gradient obtained at each step by querying the target model. In this paper, we propose Ada-FGSM (Adaptive FGSM), a new iterative attack that adaptively allocates step size of noises according to gradient information at each step. Improvement of success rate and accuracy decrease measured on ImageNet with multiple models emphasizes the validity of our method. We analyze the process of iterative attack by visualizing their trajectory and gradient contour, and further explain the vulnerability of deep neural networks to variable step size adversarial examples.

KW - Adversarial attack

KW - Adversarial example

KW - Image classification

UR - http://www.scopus.com/inward/record.url?scp=85081343001&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2020.107309

DO - 10.1016/j.patcog.2020.107309

M3 - Article

AN - SCOPUS:85081343001

SN - 0031-3203

VL - 105

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 107309

ER -

Adaptive iterative attack towards explainable adversarial robustness

摘要

访问文件

其它文件与链接

指纹

引用此