Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem

Zhixin Zhao; Jie Chen; Bin Xin; Li Li; Keming Jiao; Yifan Zheng

doi:10.1007/s11424-024-4029-8

Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem

Zhixin Zhao, Jie Chen, Bin Xin^*, Li Li, Keming Jiao, Yifan Zheng

^*此作品的通讯作者

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

The multi-UAV adversary swarm defense (MUASD) problem is to defend a static base against an adversary UAV swarm by a defensive UAV swarm. Decomposing the problem into task assignment and low-level interception strategies is a widely used approach. Learning-based approaches for task assignment are a promising direction. Existing studies on learning-based methods generally assume decentralized decision-making architecture, which is not beneficial for conflict resolution. In contrast, centralized decision-making architecture is beneficial for conflict resolution while it is often detrimental to scalability. To achieve scalability and conflict resolution simultaneously, inspired by a self-attention-based task assignment method for sensor target coverage problem, a scalable centralized assignment method based on self-attention mechanism together with a defender-attacker pairwise observation preprocessing (DAP-SelfAtt) is proposed. Then, an imperative-priori conflict resolution (IPCR) mechanism is proposed to achieve conflict-free assignment. Further, the IPCR mechanism is parallelized to enable efficient training. To validate the algorithm, a variant of proximal policy optimization algorithm (PPO) is employed for training in scenarios of various scales. The experimental results show that the proposed algorithm not only achieves conflict-free task assignment but also maintains scalability, and significantly improve the success rate of defense.

源语言	英语
页（从-至）	369-388
页数	20
期刊	Journal of Systems Science and Complexity
卷	37
期	1
DOI	https://doi.org/10.1007/s11424-024-4029-8
出版状态	已出版 - 2月 2024

访问文件

10.1007/s11424-024-4029-8

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{da088b26b0434dfbaff0fe6726e55c6d,

title = "Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem",

abstract = "The multi-UAV adversary swarm defense (MUASD) problem is to defend a static base against an adversary UAV swarm by a defensive UAV swarm. Decomposing the problem into task assignment and low-level interception strategies is a widely used approach. Learning-based approaches for task assignment are a promising direction. Existing studies on learning-based methods generally assume decentralized decision-making architecture, which is not beneficial for conflict resolution. In contrast, centralized decision-making architecture is beneficial for conflict resolution while it is often detrimental to scalability. To achieve scalability and conflict resolution simultaneously, inspired by a self-attention-based task assignment method for sensor target coverage problem, a scalable centralized assignment method based on self-attention mechanism together with a defender-attacker pairwise observation preprocessing (DAP-SelfAtt) is proposed. Then, an imperative-priori conflict resolution (IPCR) mechanism is proposed to achieve conflict-free assignment. Further, the IPCR mechanism is parallelized to enable efficient training. To validate the algorithm, a variant of proximal policy optimization algorithm (PPO) is employed for training in scenarios of various scales. The experimental results show that the proposed algorithm not only achieves conflict-free task assignment but also maintains scalability, and significantly improve the success rate of defense.",

keywords = "Conflict resolution, reinforcement learning, scalability, task assignment",

author = "Zhixin Zhao and Jie Chen and Bin Xin and Li Li and Keming Jiao and Yifan Zheng",

note = "Publisher Copyright: {\textcopyright} The Editorial Office of JSSC & Springer-Verlag GmbH Germany 2024.",

year = "2024",

month = feb,

doi = "10.1007/s11424-024-4029-8",

language = "English",

volume = "37",

pages = "369--388",

journal = "Journal of Systems Science and Complexity",

issn = "1009-6124",

publisher = "Springer New York",

number = "1",

}

TY - JOUR

T1 - Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem

AU - Zhao, Zhixin

AU - Chen, Jie

AU - Xin, Bin

AU - Li, Li

AU - Jiao, Keming

AU - Zheng, Yifan

N1 - Publisher Copyright: © The Editorial Office of JSSC & Springer-Verlag GmbH Germany 2024.

PY - 2024/2

Y1 - 2024/2

N2 - The multi-UAV adversary swarm defense (MUASD) problem is to defend a static base against an adversary UAV swarm by a defensive UAV swarm. Decomposing the problem into task assignment and low-level interception strategies is a widely used approach. Learning-based approaches for task assignment are a promising direction. Existing studies on learning-based methods generally assume decentralized decision-making architecture, which is not beneficial for conflict resolution. In contrast, centralized decision-making architecture is beneficial for conflict resolution while it is often detrimental to scalability. To achieve scalability and conflict resolution simultaneously, inspired by a self-attention-based task assignment method for sensor target coverage problem, a scalable centralized assignment method based on self-attention mechanism together with a defender-attacker pairwise observation preprocessing (DAP-SelfAtt) is proposed. Then, an imperative-priori conflict resolution (IPCR) mechanism is proposed to achieve conflict-free assignment. Further, the IPCR mechanism is parallelized to enable efficient training. To validate the algorithm, a variant of proximal policy optimization algorithm (PPO) is employed for training in scenarios of various scales. The experimental results show that the proposed algorithm not only achieves conflict-free task assignment but also maintains scalability, and significantly improve the success rate of defense.

AB - The multi-UAV adversary swarm defense (MUASD) problem is to defend a static base against an adversary UAV swarm by a defensive UAV swarm. Decomposing the problem into task assignment and low-level interception strategies is a widely used approach. Learning-based approaches for task assignment are a promising direction. Existing studies on learning-based methods generally assume decentralized decision-making architecture, which is not beneficial for conflict resolution. In contrast, centralized decision-making architecture is beneficial for conflict resolution while it is often detrimental to scalability. To achieve scalability and conflict resolution simultaneously, inspired by a self-attention-based task assignment method for sensor target coverage problem, a scalable centralized assignment method based on self-attention mechanism together with a defender-attacker pairwise observation preprocessing (DAP-SelfAtt) is proposed. Then, an imperative-priori conflict resolution (IPCR) mechanism is proposed to achieve conflict-free assignment. Further, the IPCR mechanism is parallelized to enable efficient training. To validate the algorithm, a variant of proximal policy optimization algorithm (PPO) is employed for training in scenarios of various scales. The experimental results show that the proposed algorithm not only achieves conflict-free task assignment but also maintains scalability, and significantly improve the success rate of defense.

KW - Conflict resolution

KW - reinforcement learning

KW - scalability

KW - task assignment

UR - http://www.scopus.com/inward/record.url?scp=85186120382&partnerID=8YFLogxK

U2 - 10.1007/s11424-024-4029-8

DO - 10.1007/s11424-024-4029-8

M3 - Article

AN - SCOPUS:85186120382

SN - 1009-6124

VL - 37

SP - 369

EP - 388

JO - Journal of Systems Science and Complexity

JF - Journal of Systems Science and Complexity

IS - 1

ER -

Learning Scalable Task Assignment with Imperative-Priori Conflict Resolution in Multi-UAV Adversarial Swarm Defense Problem

摘要

访问文件

其它文件与链接

指纹

引用此