Action-Manipulation Attack and Defense to X-Armed Bandits

Zhi Luo; Youqi Li; Lixing Chen; Zichuan Xu; Pan Zhou

doi:10.1109/TrustCom56396.2022.00153

Action-Manipulation Attack and Defense to X-Armed Bandits

Zhi Luo, Youqi Li, Lixing Chen, Zichuan Xu, Pan Zhou^*

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

As a continuous variant of Multi-armed bandits (MAB), X-armed bandits have enriched many applications of online machine learning like personalized recommendation system. However, the attack and defense to the X-armed bandits remain largely unexplored, though the MAB has proved to be vulnerable. In this paper, we aim to bridge this gap and investigate the robustness analysis for the X-armed bandits. Specifically, we consider action-manipulation attack, which is practical but harder than the existing reward-manipulation attack. We propose an attack algorithm based on a lower bound tree (LBT), which can continuously hijack the learner's action by perturbing X-armed bandits' high confidence tree (HCT) construction. As a result, the nodes including the arm targeted by the attacker is selected frequently with a sublinear attack cost. To defend against the LBT attack, we propose a robust version of the HCT algorithm, called RoHCT. We theoretically analyze that the regret of RoHCT is related to the upper bound of the total cost Q and still sublinear to total number of rounds T. We carry out experiments to evaluate the effectiveness of LBT and RoHCT.

Original language	English
Title of host publication	Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1115-1122
Number of pages	8
ISBN (Electronic)	9781665494250
DOIs	https://doi.org/10.1109/TrustCom56396.2022.00153
Publication status	Published - 2022
Event	21st IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022 - Virtual, Online, China Duration: 9 Dec 2022 → 11 Dec 2022

Publication series

Name	Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022

Conference

Conference	21st IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022
Country/Territory	China
City	Virtual, Online
Period	9/12/22 → 11/12/22

Keywords

action-manipulation attack
defense
robustness
χ-armed bandits

Access to Document

10.1109/TrustCom56396.2022.00153

Cite this

Luo, Z., Li, Y., Chen, L., Xu, Z., & Zhou, P. (2022). Action-Manipulation Attack and Defense to X-Armed Bandits. In Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022 (pp. 1115-1122). (Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/TrustCom56396.2022.00153

Luo, Zhi ; Li, Youqi ; Chen, Lixing et al. / Action-Manipulation Attack and Defense to X-Armed Bandits. Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022. Institute of Electrical and Electronics Engineers Inc., 2022. pp. 1115-1122 (Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022).

@inproceedings{23b968887fab42d4b4a51a577c8e52bc,

title = "Action-Manipulation Attack and Defense to X-Armed Bandits",

abstract = "As a continuous variant of Multi-armed bandits (MAB), X-armed bandits have enriched many applications of online machine learning like personalized recommendation system. However, the attack and defense to the X-armed bandits remain largely unexplored, though the MAB has proved to be vulnerable. In this paper, we aim to bridge this gap and investigate the robustness analysis for the X-armed bandits. Specifically, we consider action-manipulation attack, which is practical but harder than the existing reward-manipulation attack. We propose an attack algorithm based on a lower bound tree (LBT), which can continuously hijack the learner's action by perturbing X-armed bandits' high confidence tree (HCT) construction. As a result, the nodes including the arm targeted by the attacker is selected frequently with a sublinear attack cost. To defend against the LBT attack, we propose a robust version of the HCT algorithm, called RoHCT. We theoretically analyze that the regret of RoHCT is related to the upper bound of the total cost Q and still sublinear to total number of rounds T. We carry out experiments to evaluate the effectiveness of LBT and RoHCT.",

keywords = "action-manipulation attack, defense, robustness, χ-armed bandits",

author = "Zhi Luo and Youqi Li and Lixing Chen and Zichuan Xu and Pan Zhou",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 21st IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022 ; Conference date: 09-12-2022 Through 11-12-2022",

year = "2022",

doi = "10.1109/TrustCom56396.2022.00153",

language = "English",

series = "Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1115--1122",

booktitle = "Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022",

address = "United States",

}

Luo, Z, Li, Y, Chen, L, Xu, Z & Zhou, P 2022, Action-Manipulation Attack and Defense to X-Armed Bandits. in Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022. Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022, Institute of Electrical and Electronics Engineers Inc., pp. 1115-1122, 21st IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022, Virtual, Online, China, 9/12/22. https://doi.org/10.1109/TrustCom56396.2022.00153

Action-Manipulation Attack and Defense to X-Armed Bandits. / Luo, Zhi; Li, Youqi; Chen, Lixing et al.
Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022. Institute of Electrical and Electronics Engineers Inc., 2022. p. 1115-1122 (Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Action-Manipulation Attack and Defense to X-Armed Bandits

AU - Luo, Zhi

AU - Li, Youqi

AU - Chen, Lixing

AU - Xu, Zichuan

AU - Zhou, Pan

PY - 2022

Y1 - 2022

N2 - As a continuous variant of Multi-armed bandits (MAB), X-armed bandits have enriched many applications of online machine learning like personalized recommendation system. However, the attack and defense to the X-armed bandits remain largely unexplored, though the MAB has proved to be vulnerable. In this paper, we aim to bridge this gap and investigate the robustness analysis for the X-armed bandits. Specifically, we consider action-manipulation attack, which is practical but harder than the existing reward-manipulation attack. We propose an attack algorithm based on a lower bound tree (LBT), which can continuously hijack the learner's action by perturbing X-armed bandits' high confidence tree (HCT) construction. As a result, the nodes including the arm targeted by the attacker is selected frequently with a sublinear attack cost. To defend against the LBT attack, we propose a robust version of the HCT algorithm, called RoHCT. We theoretically analyze that the regret of RoHCT is related to the upper bound of the total cost Q and still sublinear to total number of rounds T. We carry out experiments to evaluate the effectiveness of LBT and RoHCT.

AB - As a continuous variant of Multi-armed bandits (MAB), X-armed bandits have enriched many applications of online machine learning like personalized recommendation system. However, the attack and defense to the X-armed bandits remain largely unexplored, though the MAB has proved to be vulnerable. In this paper, we aim to bridge this gap and investigate the robustness analysis for the X-armed bandits. Specifically, we consider action-manipulation attack, which is practical but harder than the existing reward-manipulation attack. We propose an attack algorithm based on a lower bound tree (LBT), which can continuously hijack the learner's action by perturbing X-armed bandits' high confidence tree (HCT) construction. As a result, the nodes including the arm targeted by the attacker is selected frequently with a sublinear attack cost. To defend against the LBT attack, we propose a robust version of the HCT algorithm, called RoHCT. We theoretically analyze that the regret of RoHCT is related to the upper bound of the total cost Q and still sublinear to total number of rounds T. We carry out experiments to evaluate the effectiveness of LBT and RoHCT.

KW - action-manipulation attack

KW - defense

KW - robustness

KW - χ-armed bandits

UR - http://www.scopus.com/inward/record.url?scp=85151747359&partnerID=8YFLogxK

U2 - 10.1109/TrustCom56396.2022.00153

DO - 10.1109/TrustCom56396.2022.00153

M3 - Conference contribution

AN - SCOPUS:85151747359

T3 - Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022

SP - 1115

EP - 1122

BT - Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 21st IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022

Y2 - 9 December 2022 through 11 December 2022

ER -

Luo Z, Li Y, Chen L, Xu Z, Zhou P. Action-Manipulation Attack and Defense to X-Armed Bandits. In Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022. Institute of Electrical and Electronics Engineers Inc. 2022. p. 1115-1122. (Proceedings - 2022 IEEE 21st International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2022). doi: 10.1109/TrustCom56396.2022.00153

Action-Manipulation Attack and Defense to X-Armed Bandits

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this