A multitarget backdooring attack on deep neural networks with random location trigger

Yu Xiao; Liu Cong; Zheng Mingwen; Wang Yajie; Liu Xinrui; Song Shuxiao; Ma Yuexuan; Zheng Jun

doi:10.1002/int.22785

A multitarget backdooring attack on deep neural networks with random location trigger

Yu Xiao, Liu Cong, Zheng Mingwen, Wang Yajie, Liu Xinrui, Song Shuxiao, Ma Yuexuan, Zheng Jun^*

^*此作品的通讯作者

网络空间安全学院

科研成果: 期刊稿件 › 文章 › 同行评审

14 引用（Scopus）

摘要

Machine learning has made tremendous progress and applied to various critical practical applications. However, recent studies have shown that machine learning models are vulnerable to malicious attackers, such as neural network backdoor triggering. A successful backdoor triggering behavior may cause serious consequences, such as allowing the attacker to bypass the identity verification and directly enter the system. In image classification, there is always only one target label triggered by one backdoor trigger in previous works. The position of the backdoor trigger is also fixed, which brings limitations to the attack. In this paper, we propose a novel method that utilizes one trigger pattern to correspond to multiple target labels, and the location of the trigger is not limited. In our method, the trigger guarantees that the malicious output is within the range of multiple targets chosen by the attacker, but the specific target depends on the original image where the trigger is pasted. Due to the original images' diversity, it is difficult for the defender to predict which target the image with the trigger is classified as. Besides, the attacker can use only one trigger pattern to achieve multitarget attacks at different locations, which brings more flexibility. We also proposed to train a neural network as a detector to distinguish backdoor images and clean images for multitarget backdooring attacks. Experiment results show that the detection method can also successfully detect the backdoor image with a trigger at a random location of the image, and the detection success rate is as high as 86.02%.

源语言	英语
页（从-至）	2567-2583
页数	17
期刊	International Journal of Intelligent Systems
卷	37
期	3
DOI	https://doi.org/10.1002/int.22785
出版状态	已出版 - 3月 2022

访问文件

10.1002/int.22785

其它文件与链接

链接到 Scopus 的出版物

引用此

Xiao, Y., Cong, L., Mingwen, Z., Yajie, W., Xinrui, L., Shuxiao, S., Yuexuan, M., & Jun, Z. (2022). A multitarget backdooring attack on deep neural networks with random location trigger. International Journal of Intelligent Systems, 37(3), 2567-2583. https://doi.org/10.1002/int.22785

@article{de1a6edef8e6444486237e81664bafe7,

title = "A multitarget backdooring attack on deep neural networks with random location trigger",

abstract = "Machine learning has made tremendous progress and applied to various critical practical applications. However, recent studies have shown that machine learning models are vulnerable to malicious attackers, such as neural network backdoor triggering. A successful backdoor triggering behavior may cause serious consequences, such as allowing the attacker to bypass the identity verification and directly enter the system. In image classification, there is always only one target label triggered by one backdoor trigger in previous works. The position of the backdoor trigger is also fixed, which brings limitations to the attack. In this paper, we propose a novel method that utilizes one trigger pattern to correspond to multiple target labels, and the location of the trigger is not limited. In our method, the trigger guarantees that the malicious output is within the range of multiple targets chosen by the attacker, but the specific target depends on the original image where the trigger is pasted. Due to the original images' diversity, it is difficult for the defender to predict which target the image with the trigger is classified as. Besides, the attacker can use only one trigger pattern to achieve multitarget attacks at different locations, which brings more flexibility. We also proposed to train a neural network as a detector to distinguish backdoor images and clean images for multitarget backdooring attacks. Experiment results show that the detection method can also successfully detect the backdoor image with a trigger at a random location of the image, and the detection success rate is as high as 86.02%.",

author = "Yu Xiao and Liu Cong and Zheng Mingwen and Wang Yajie and Liu Xinrui and Song Shuxiao and Ma Yuexuan and Zheng Jun",

note = "Publisher Copyright: {\textcopyright} 2021 Wiley Periodicals LLC.",

year = "2022",

month = mar,

doi = "10.1002/int.22785",

language = "English",

volume = "37",

pages = "2567--2583",

journal = "International Journal of Intelligent Systems",

issn = "0884-8173",

publisher = "John Wiley and Sons Inc.",

number = "3",

}

TY - JOUR

T1 - A multitarget backdooring attack on deep neural networks with random location trigger

AU - Xiao, Yu

AU - Cong, Liu

AU - Mingwen, Zheng

AU - Yajie, Wang

AU - Xinrui, Liu

AU - Shuxiao, Song

AU - Yuexuan, Ma

AU - Jun, Zheng

PY - 2022/3

Y1 - 2022/3

N2 - Machine learning has made tremendous progress and applied to various critical practical applications. However, recent studies have shown that machine learning models are vulnerable to malicious attackers, such as neural network backdoor triggering. A successful backdoor triggering behavior may cause serious consequences, such as allowing the attacker to bypass the identity verification and directly enter the system. In image classification, there is always only one target label triggered by one backdoor trigger in previous works. The position of the backdoor trigger is also fixed, which brings limitations to the attack. In this paper, we propose a novel method that utilizes one trigger pattern to correspond to multiple target labels, and the location of the trigger is not limited. In our method, the trigger guarantees that the malicious output is within the range of multiple targets chosen by the attacker, but the specific target depends on the original image where the trigger is pasted. Due to the original images' diversity, it is difficult for the defender to predict which target the image with the trigger is classified as. Besides, the attacker can use only one trigger pattern to achieve multitarget attacks at different locations, which brings more flexibility. We also proposed to train a neural network as a detector to distinguish backdoor images and clean images for multitarget backdooring attacks. Experiment results show that the detection method can also successfully detect the backdoor image with a trigger at a random location of the image, and the detection success rate is as high as 86.02%.

AB - Machine learning has made tremendous progress and applied to various critical practical applications. However, recent studies have shown that machine learning models are vulnerable to malicious attackers, such as neural network backdoor triggering. A successful backdoor triggering behavior may cause serious consequences, such as allowing the attacker to bypass the identity verification and directly enter the system. In image classification, there is always only one target label triggered by one backdoor trigger in previous works. The position of the backdoor trigger is also fixed, which brings limitations to the attack. In this paper, we propose a novel method that utilizes one trigger pattern to correspond to multiple target labels, and the location of the trigger is not limited. In our method, the trigger guarantees that the malicious output is within the range of multiple targets chosen by the attacker, but the specific target depends on the original image where the trigger is pasted. Due to the original images' diversity, it is difficult for the defender to predict which target the image with the trigger is classified as. Besides, the attacker can use only one trigger pattern to achieve multitarget attacks at different locations, which brings more flexibility. We also proposed to train a neural network as a detector to distinguish backdoor images and clean images for multitarget backdooring attacks. Experiment results show that the detection method can also successfully detect the backdoor image with a trigger at a random location of the image, and the detection success rate is as high as 86.02%.

UR - http://www.scopus.com/inward/record.url?scp=85122077333&partnerID=8YFLogxK

U2 - 10.1002/int.22785

DO - 10.1002/int.22785

M3 - Article

AN - SCOPUS:85122077333

SN - 0884-8173

VL - 37

SP - 2567

EP - 2583

JO - International Journal of Intelligent Systems

JF - International Journal of Intelligent Systems

IS - 3

ER -

A multitarget backdooring attack on deep neural networks with random location trigger

摘要

访问文件

其它文件与链接

指纹

引用此