Backdoor Attacks on Image Classification Models in Deep Neural Networks

Quanxin Zhang; M. A. Wencong; Yajie Wang; Yaoyuan Zhang; Zhiwei Shi; L. I. Yuanzhang

doi:10.1049/cje.2021.00.126

Backdoor Attacks on Image Classification Models in Deep Neural Networks

Quanxin Zhang^*, M. A. Wencong, Yajie Wang^*, Yaoyuan Zhang, Zhiwei Shi, L. I. Yuanzhang

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Review article › peer-review

20 Citations (Scopus)

Abstract

Deep neural network (DNN) is applied widely in many applications and achieves state-of-the-art performance. However, DNN lacks transparency and interpretability for users in structure. Attackers can use this feature to embed trojan horses in the DNN structure, such as inserting a backdoor into the DNN, so that DNN can learn both the normal main task and additional malicious tasks at the same time. Besides, DNN relies on data set for training. Attackers can tamper with training data to interfere with DNN training process, such as attaching a trigger on input data. Because of defects in DNN structure and data, the backdoor attack can be a serious threat to the security of DNN. The DNN attacked by backdoor performs well on benign inputs while it outputs an attacker-specified label on trigger attached inputs. Backdoor attack can be conducted in almost every stage of the machine learning pipeline. Although there are a few researches in the backdoor attack on image classification, a systematic review is still rare in this field. This paper is a comprehensive review of backdoor attacks. According to whether attackers have access to the training data, we divide various backdoor attacks into two types: poisoning-based attacks and non-poisoning-based attacks. We go through the details of each work in the timeline, discussing its contribution and deficiencies. We propose a detailed mathematical backdoor model to summary all kinds of backdoor attacks. In the end, we provide some insights about future studies.

Original language	English
Pages (from-to)	199-212
Number of pages	14
Journal	Chinese Journal of Electronics
Volume	31
Issue number	2
DOIs	https://doi.org/10.1049/cje.2021.00.126
Publication status	Published - Mar 2022

Keywords

Backdoor attack
Non-poisoning-based attacks
Poisoning-based attacks
Review
Security

Access to Document

10.1049/cje.2021.00.126

Cite this

@article{9ea751069b3142cfbc0e0119238aa1d7,

title = "Backdoor Attacks on Image Classification Models in Deep Neural Networks",

abstract = "Deep neural network (DNN) is applied widely in many applications and achieves state-of-the-art performance. However, DNN lacks transparency and interpretability for users in structure. Attackers can use this feature to embed trojan horses in the DNN structure, such as inserting a backdoor into the DNN, so that DNN can learn both the normal main task and additional malicious tasks at the same time. Besides, DNN relies on data set for training. Attackers can tamper with training data to interfere with DNN training process, such as attaching a trigger on input data. Because of defects in DNN structure and data, the backdoor attack can be a serious threat to the security of DNN. The DNN attacked by backdoor performs well on benign inputs while it outputs an attacker-specified label on trigger attached inputs. Backdoor attack can be conducted in almost every stage of the machine learning pipeline. Although there are a few researches in the backdoor attack on image classification, a systematic review is still rare in this field. This paper is a comprehensive review of backdoor attacks. According to whether attackers have access to the training data, we divide various backdoor attacks into two types: poisoning-based attacks and non-poisoning-based attacks. We go through the details of each work in the timeline, discussing its contribution and deficiencies. We propose a detailed mathematical backdoor model to summary all kinds of backdoor attacks. In the end, we provide some insights about future studies.",

keywords = "Backdoor attack, Non-poisoning-based attacks, Poisoning-based attacks, Review, Security",

author = "Quanxin Zhang and Wencong, {M. A.} and Yajie Wang and Yaoyuan Zhang and Zhiwei Shi and Yuanzhang, {L. I.}",

note = "Publisher Copyright: {\textcopyright} 2022 Chinese Institute of Electronics",

year = "2022",

month = mar,

doi = "10.1049/cje.2021.00.126",

language = "English",

volume = "31",

pages = "199--212",

journal = "Chinese Journal of Electronics",

issn = "1022-4653",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2",

}

TY - JOUR

T1 - Backdoor Attacks on Image Classification Models in Deep Neural Networks

AU - Zhang, Quanxin

AU - Wencong, M. A.

AU - Wang, Yajie

AU - Zhang, Yaoyuan

AU - Shi, Zhiwei

AU - Yuanzhang, L. I.

PY - 2022/3

Y1 - 2022/3

N2 - Deep neural network (DNN) is applied widely in many applications and achieves state-of-the-art performance. However, DNN lacks transparency and interpretability for users in structure. Attackers can use this feature to embed trojan horses in the DNN structure, such as inserting a backdoor into the DNN, so that DNN can learn both the normal main task and additional malicious tasks at the same time. Besides, DNN relies on data set for training. Attackers can tamper with training data to interfere with DNN training process, such as attaching a trigger on input data. Because of defects in DNN structure and data, the backdoor attack can be a serious threat to the security of DNN. The DNN attacked by backdoor performs well on benign inputs while it outputs an attacker-specified label on trigger attached inputs. Backdoor attack can be conducted in almost every stage of the machine learning pipeline. Although there are a few researches in the backdoor attack on image classification, a systematic review is still rare in this field. This paper is a comprehensive review of backdoor attacks. According to whether attackers have access to the training data, we divide various backdoor attacks into two types: poisoning-based attacks and non-poisoning-based attacks. We go through the details of each work in the timeline, discussing its contribution and deficiencies. We propose a detailed mathematical backdoor model to summary all kinds of backdoor attacks. In the end, we provide some insights about future studies.

AB - Deep neural network (DNN) is applied widely in many applications and achieves state-of-the-art performance. However, DNN lacks transparency and interpretability for users in structure. Attackers can use this feature to embed trojan horses in the DNN structure, such as inserting a backdoor into the DNN, so that DNN can learn both the normal main task and additional malicious tasks at the same time. Besides, DNN relies on data set for training. Attackers can tamper with training data to interfere with DNN training process, such as attaching a trigger on input data. Because of defects in DNN structure and data, the backdoor attack can be a serious threat to the security of DNN. The DNN attacked by backdoor performs well on benign inputs while it outputs an attacker-specified label on trigger attached inputs. Backdoor attack can be conducted in almost every stage of the machine learning pipeline. Although there are a few researches in the backdoor attack on image classification, a systematic review is still rare in this field. This paper is a comprehensive review of backdoor attacks. According to whether attackers have access to the training data, we divide various backdoor attacks into two types: poisoning-based attacks and non-poisoning-based attacks. We go through the details of each work in the timeline, discussing its contribution and deficiencies. We propose a detailed mathematical backdoor model to summary all kinds of backdoor attacks. In the end, we provide some insights about future studies.

KW - Backdoor attack

KW - Non-poisoning-based attacks

KW - Poisoning-based attacks

KW - Review

KW - Security

UR - http://www.scopus.com/inward/record.url?scp=85126050609&partnerID=8YFLogxK

U2 - 10.1049/cje.2021.00.126

DO - 10.1049/cje.2021.00.126

M3 - Review article

AN - SCOPUS:85126050609

SN - 1022-4653

VL - 31

SP - 199

EP - 212

JO - Chinese Journal of Electronics

JF - Chinese Journal of Electronics

IS - 2

ER -

Backdoor Attacks on Image Classification Models in Deep Neural Networks

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this