Boosting targeted black-box attacks via ensemble substitute training and linear augmentation

Xianfeng Gao; Yu An Tan; Hongwei Jiang; Quanxin Zhang; Xiaohui Kuang

doi:10.3390/app9112286

Boosting targeted black-box attacks via ensemble substitute training and linear augmentation

Xianfeng Gao, Yu An Tan, Hongwei Jiang, Quanxin Zhang, Xiaohui Kuang^*

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

26 引用（Scopus）

摘要

These years, Deep Neural Networks (DNNs) have shown unprecedented performance in many areas. However, some recent studies revealed their vulnerability to small perturbations added on source inputs. Furthermore, we call the ways to generate these perturbations' adversarial attacks, which contain two types, black-box and white-box attacks, according to the adversaries' access to target models. In order to overcome the problem of black-box attackers' unreachabilities to the internals of target DNN, many researchers put forward a series of strategies. Previous works include a method of training a local substitute model for the target black-box model via Jacobian-based augmentation and then use the substitute model to craft adversarial examples using white-box methods. In this work, we improve the dataset augmentation to make the substitute models better fit the decision boundary of the target model. Unlike the previous work that just performed the non-targeted attack, we make it first to generate targeted adversarial examples via training substitute models. Moreover, to boost the targeted attacks, we apply the idea of ensemble attacks to the substitute training. Experiments on MNIST and GTSRB, two common datasets for image classification, demonstrate our effectiveness and efficiency of boosting a targeted black-box attack, and we finally attack the MNIST and GTSRB classifiers with the success rates of 97.7% and 92.8%.

源语言	英语
文章编号	2286
期刊	Applied Sciences (Switzerland)
卷	9
期	11
DOI	https://doi.org/10.3390/app9112286
出版状态	已出版 - 1 6月 2019

访问文件

10.3390/app9112286

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{03e003bb84b44afca8db0dc09ddcf946,

title = "Boosting targeted black-box attacks via ensemble substitute training and linear augmentation",

abstract = "These years, Deep Neural Networks (DNNs) have shown unprecedented performance in many areas. However, some recent studies revealed their vulnerability to small perturbations added on source inputs. Furthermore, we call the ways to generate these perturbations' adversarial attacks, which contain two types, black-box and white-box attacks, according to the adversaries' access to target models. In order to overcome the problem of black-box attackers' unreachabilities to the internals of target DNN, many researchers put forward a series of strategies. Previous works include a method of training a local substitute model for the target black-box model via Jacobian-based augmentation and then use the substitute model to craft adversarial examples using white-box methods. In this work, we improve the dataset augmentation to make the substitute models better fit the decision boundary of the target model. Unlike the previous work that just performed the non-targeted attack, we make it first to generate targeted adversarial examples via training substitute models. Moreover, to boost the targeted attacks, we apply the idea of ensemble attacks to the substitute training. Experiments on MNIST and GTSRB, two common datasets for image classification, demonstrate our effectiveness and efficiency of boosting a targeted black-box attack, and we finally attack the MNIST and GTSRB classifiers with the success rates of 97.7% and 92.8%.",

keywords = "Adversarial attack, Black-box attack, Dataset augmentation, Deep learning, Substitute training",

author = "Xianfeng Gao and Tan, {Yu An} and Hongwei Jiang and Quanxin Zhang and Xiaohui Kuang",

note = "Publisher Copyright: {\textcopyright} 2019 by the authors.",

year = "2019",

month = jun,

day = "1",

doi = "10.3390/app9112286",

language = "English",

volume = "9",

journal = "Applied Sciences (Switzerland)",

issn = "2076-3417",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "11",

}

TY - JOUR

T1 - Boosting targeted black-box attacks via ensemble substitute training and linear augmentation

AU - Gao, Xianfeng

AU - Tan, Yu An

AU - Jiang, Hongwei

AU - Zhang, Quanxin

AU - Kuang, Xiaohui

PY - 2019/6/1

Y1 - 2019/6/1

N2 - These years, Deep Neural Networks (DNNs) have shown unprecedented performance in many areas. However, some recent studies revealed their vulnerability to small perturbations added on source inputs. Furthermore, we call the ways to generate these perturbations' adversarial attacks, which contain two types, black-box and white-box attacks, according to the adversaries' access to target models. In order to overcome the problem of black-box attackers' unreachabilities to the internals of target DNN, many researchers put forward a series of strategies. Previous works include a method of training a local substitute model for the target black-box model via Jacobian-based augmentation and then use the substitute model to craft adversarial examples using white-box methods. In this work, we improve the dataset augmentation to make the substitute models better fit the decision boundary of the target model. Unlike the previous work that just performed the non-targeted attack, we make it first to generate targeted adversarial examples via training substitute models. Moreover, to boost the targeted attacks, we apply the idea of ensemble attacks to the substitute training. Experiments on MNIST and GTSRB, two common datasets for image classification, demonstrate our effectiveness and efficiency of boosting a targeted black-box attack, and we finally attack the MNIST and GTSRB classifiers with the success rates of 97.7% and 92.8%.

AB - These years, Deep Neural Networks (DNNs) have shown unprecedented performance in many areas. However, some recent studies revealed their vulnerability to small perturbations added on source inputs. Furthermore, we call the ways to generate these perturbations' adversarial attacks, which contain two types, black-box and white-box attacks, according to the adversaries' access to target models. In order to overcome the problem of black-box attackers' unreachabilities to the internals of target DNN, many researchers put forward a series of strategies. Previous works include a method of training a local substitute model for the target black-box model via Jacobian-based augmentation and then use the substitute model to craft adversarial examples using white-box methods. In this work, we improve the dataset augmentation to make the substitute models better fit the decision boundary of the target model. Unlike the previous work that just performed the non-targeted attack, we make it first to generate targeted adversarial examples via training substitute models. Moreover, to boost the targeted attacks, we apply the idea of ensemble attacks to the substitute training. Experiments on MNIST and GTSRB, two common datasets for image classification, demonstrate our effectiveness and efficiency of boosting a targeted black-box attack, and we finally attack the MNIST and GTSRB classifiers with the success rates of 97.7% and 92.8%.

KW - Adversarial attack

KW - Black-box attack

KW - Dataset augmentation

KW - Deep learning

KW - Substitute training

UR - http://www.scopus.com/inward/record.url?scp=85067252170&partnerID=8YFLogxK

U2 - 10.3390/app9112286

DO - 10.3390/app9112286

M3 - Article

AN - SCOPUS:85067252170

SN - 2076-3417

VL - 9

JO - Applied Sciences (Switzerland)

JF - Applied Sciences (Switzerland)

IS - 11

M1 - 2286

ER -

Boosting targeted black-box attacks via ensemble substitute training and linear augmentation

摘要

访问文件

其它文件与链接

指纹

引用此