Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem

Zhixin Zhao; Jie Chen; Bin Xin; Li Li; Keming Jiao; Yifan Zheng

doi:10.1007/s11424-024-4029-8

Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem

Zhixin Zhao, Jie Chen, Bin Xin^*, Li Li, Keming Jiao, Yifan Zheng

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

Abstract

The multi-UAV adversary swarm defense (MUASD) problem is to defend a static base against an adversary UAV swarm by a defensive UAV swarm. Decomposing the problem into task assignment and low-level interception strategies is a widely used approach. Learning-based approaches for task assignment are a promising direction. Existing studies on learning-based methods generally assume decentralized decision-making architecture, which is not beneficial for conflict resolution. In contrast, centralized decision-making architecture is beneficial for conflict resolution while it is often detrimental to scalability. To achieve scalability and conflict resolution simultaneously, inspired by a self-attention-based task assignment method for sensor target coverage problem, a scalable centralized assignment method based on self-attention mechanism together with a defender-attacker pairwise observation preprocessing (DAP-SelfAtt) is proposed. Then, an imperative-priori conflict resolution (IPCR) mechanism is proposed to achieve conflict-free assignment. Further, the IPCR mechanism is parallelized to enable efficient training. To validate the algorithm, a variant of proximal policy optimization algorithm (PPO) is employed for training in scenarios of various scales. The experimental results show that the proposed algorithm not only achieves conflict-free task assignment but also maintains scalability, and significantly improve the success rate of defense.

Original language	English
Pages (from-to)	369-388
Number of pages	20
Journal	Journal of Systems Science and Complexity
Volume	37
Issue number	1
DOIs	https://doi.org/10.1007/s11424-024-4029-8
Publication status	Published - Feb 2024

Keywords

Conflict resolution
reinforcement learning
scalability
task assignment

Access to Document

10.1007/s11424-024-4029-8

Cite this

Zhao, Z., Chen, J., Xin, B., Li, L., Jiao, K., & Zheng, Y. (2024). Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem. Journal of Systems Science and Complexity, 37(1), 369-388. https://doi.org/10.1007/s11424-024-4029-8

@article{da088b26b0434dfbaff0fe6726e55c6d,

title = "Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem",

abstract = "The multi-UAV adversary swarm defense (MUASD) problem is to defend a static base against an adversary UAV swarm by a defensive UAV swarm. Decomposing the problem into task assignment and low-level interception strategies is a widely used approach. Learning-based approaches for task assignment are a promising direction. Existing studies on learning-based methods generally assume decentralized decision-making architecture, which is not beneficial for conflict resolution. In contrast, centralized decision-making architecture is beneficial for conflict resolution while it is often detrimental to scalability. To achieve scalability and conflict resolution simultaneously, inspired by a self-attention-based task assignment method for sensor target coverage problem, a scalable centralized assignment method based on self-attention mechanism together with a defender-attacker pairwise observation preprocessing (DAP-SelfAtt) is proposed. Then, an imperative-priori conflict resolution (IPCR) mechanism is proposed to achieve conflict-free assignment. Further, the IPCR mechanism is parallelized to enable efficient training. To validate the algorithm, a variant of proximal policy optimization algorithm (PPO) is employed for training in scenarios of various scales. The experimental results show that the proposed algorithm not only achieves conflict-free task assignment but also maintains scalability, and significantly improve the success rate of defense.",

keywords = "Conflict resolution, reinforcement learning, scalability, task assignment",

author = "Zhixin Zhao and Jie Chen and Bin Xin and Li Li and Keming Jiao and Yifan Zheng",

note = "Publisher Copyright: {\textcopyright} The Editorial Office of JSSC & Springer-Verlag GmbH Germany 2024.",

year = "2024",

month = feb,

doi = "10.1007/s11424-024-4029-8",

language = "English",

volume = "37",

pages = "369--388",

journal = "Journal of Systems Science and Complexity",

issn = "1009-6124",

publisher = "Springer New York",

number = "1",

}

TY - JOUR

T1 - Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem

AU - Zhao, Zhixin

AU - Chen, Jie

AU - Xin, Bin

AU - Li, Li

AU - Jiao, Keming

AU - Zheng, Yifan

N1 - Publisher Copyright: © The Editorial Office of JSSC & Springer-Verlag GmbH Germany 2024.

PY - 2024/2

Y1 - 2024/2

N2 - The multi-UAV adversary swarm defense (MUASD) problem is to defend a static base against an adversary UAV swarm by a defensive UAV swarm. Decomposing the problem into task assignment and low-level interception strategies is a widely used approach. Learning-based approaches for task assignment are a promising direction. Existing studies on learning-based methods generally assume decentralized decision-making architecture, which is not beneficial for conflict resolution. In contrast, centralized decision-making architecture is beneficial for conflict resolution while it is often detrimental to scalability. To achieve scalability and conflict resolution simultaneously, inspired by a self-attention-based task assignment method for sensor target coverage problem, a scalable centralized assignment method based on self-attention mechanism together with a defender-attacker pairwise observation preprocessing (DAP-SelfAtt) is proposed. Then, an imperative-priori conflict resolution (IPCR) mechanism is proposed to achieve conflict-free assignment. Further, the IPCR mechanism is parallelized to enable efficient training. To validate the algorithm, a variant of proximal policy optimization algorithm (PPO) is employed for training in scenarios of various scales. The experimental results show that the proposed algorithm not only achieves conflict-free task assignment but also maintains scalability, and significantly improve the success rate of defense.

AB - The multi-UAV adversary swarm defense (MUASD) problem is to defend a static base against an adversary UAV swarm by a defensive UAV swarm. Decomposing the problem into task assignment and low-level interception strategies is a widely used approach. Learning-based approaches for task assignment are a promising direction. Existing studies on learning-based methods generally assume decentralized decision-making architecture, which is not beneficial for conflict resolution. In contrast, centralized decision-making architecture is beneficial for conflict resolution while it is often detrimental to scalability. To achieve scalability and conflict resolution simultaneously, inspired by a self-attention-based task assignment method for sensor target coverage problem, a scalable centralized assignment method based on self-attention mechanism together with a defender-attacker pairwise observation preprocessing (DAP-SelfAtt) is proposed. Then, an imperative-priori conflict resolution (IPCR) mechanism is proposed to achieve conflict-free assignment. Further, the IPCR mechanism is parallelized to enable efficient training. To validate the algorithm, a variant of proximal policy optimization algorithm (PPO) is employed for training in scenarios of various scales. The experimental results show that the proposed algorithm not only achieves conflict-free task assignment but also maintains scalability, and significantly improve the success rate of defense.

KW - Conflict resolution

KW - reinforcement learning

KW - scalability

KW - task assignment

UR - http://www.scopus.com/inward/record.url?scp=85186120382&partnerID=8YFLogxK

U2 - 10.1007/s11424-024-4029-8

DO - 10.1007/s11424-024-4029-8

M3 - Article

AN - SCOPUS:85186120382

SN - 1009-6124

VL - 37

SP - 369

EP - 388

JO - Journal of Systems Science and Complexity

JF - Journal of Systems Science and Complexity

IS - 1

ER -

Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this