非对称机动能力多无人机智能协同攻防对抗

Can Chen; Li Mo; Duo Zheng; Ziheng Cheng; Defu Lin

doi:10.7527/S1000-6893.2020.24152

非对称机动能力多无人机智能协同攻防对抗

Translated title of the contribution: Cooperative attack-defense game of multiple UAVs with asymmetric maneuverability

Can Chen, Li Mo, Duo Zheng^*, Ziheng Cheng, Defu Lin

^*Corresponding author for this work

School of Aerospace Engineering

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

22 Citations (Scopus)

Abstract

The attack-defense game is an important combat scenario of future military Unmanned Aerial Vehicles (UAVs). This paper studies an attack-defense game between groups of UAVs with different maneuverability, establishing a multi-UAV cooperative attack and defense evolution model. Based on the multi-agent reinforcement learning theory, the autonomous decision-making method of multi-UAV cooperative attack-defense game is studied, and a centralized critic and distributed actor algorithm structure is proposed based on the actor-critic algorithm, guaranteeing the convergence of the algorithm and improving the efficiency of decision-making. The critic module of UAVs uses the global information to evaluate the decision-making quality during training, while the actor module only needs to rely on the local perception information to make autonomous decisions during execution, hence improving the effectiveness of the multi-UAV attack-defense game. The simulation results show that the proposed multi-UAV reinforcement learning method has a strong self-evolution property, endowing the UAV certain intelligence, that is, the stable autonomous learning ability. Through continuous training, the UAVs can autonomously learn cooperative attack or defense policies to improve the effectiveness of decision-making.

Translated title of the contribution	Cooperative attack-defense game of multiple UAVs with asymmetric maneuverability
Original language	Chinese (Traditional)
Article number	324152
Journal	Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica
Volume	41
Issue number	12
DOIs	https://doi.org/10.7527/S1000-6893.2020.24152
Publication status	Published - 25 Dec 2020

Access to Document

10.7527/S1000-6893.2020.24152

Cite this

@article{6a74f8a26e9e4f478c3fbf1d86aceb1d,

title = "非对称机动能力多无人机智能协同攻防对抗",

abstract = "The attack-defense game is an important combat scenario of future military Unmanned Aerial Vehicles (UAVs). This paper studies an attack-defense game between groups of UAVs with different maneuverability, establishing a multi-UAV cooperative attack and defense evolution model. Based on the multi-agent reinforcement learning theory, the autonomous decision-making method of multi-UAV cooperative attack-defense game is studied, and a centralized critic and distributed actor algorithm structure is proposed based on the actor-critic algorithm, guaranteeing the convergence of the algorithm and improving the efficiency of decision-making. The critic module of UAVs uses the global information to evaluate the decision-making quality during training, while the actor module only needs to rely on the local perception information to make autonomous decisions during execution, hence improving the effectiveness of the multi-UAV attack-defense game. The simulation results show that the proposed multi-UAV reinforcement learning method has a strong self-evolution property, endowing the UAV certain intelligence, that is, the stable autonomous learning ability. Through continuous training, the UAVs can autonomously learn cooperative attack or defense policies to improve the effectiveness of decision-making.",

keywords = "Attack-defense games, Centralized critic, Distributed actors, Multi-UAV coordination, Reinforcement learning",

author = "Can Chen and Li Mo and Duo Zheng and Ziheng Cheng and Defu Lin",

year = "2020",

month = dec,

day = "25",

doi = "10.7527/S1000-6893.2020.24152",

language = "繁体中文",

volume = "41",

journal = "Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica",

issn = "1000-6893",

publisher = "AAAS Press of Chinese Society of Aeronautics and Astronautics",

number = "12",

}

TY - JOUR

T1 - 非对称机动能力多无人机智能协同攻防对抗

AU - Chen, Can

AU - Mo, Li

AU - Zheng, Duo

AU - Cheng, Ziheng

AU - Lin, Defu

PY - 2020/12/25

Y1 - 2020/12/25

N2 - The attack-defense game is an important combat scenario of future military Unmanned Aerial Vehicles (UAVs). This paper studies an attack-defense game between groups of UAVs with different maneuverability, establishing a multi-UAV cooperative attack and defense evolution model. Based on the multi-agent reinforcement learning theory, the autonomous decision-making method of multi-UAV cooperative attack-defense game is studied, and a centralized critic and distributed actor algorithm structure is proposed based on the actor-critic algorithm, guaranteeing the convergence of the algorithm and improving the efficiency of decision-making. The critic module of UAVs uses the global information to evaluate the decision-making quality during training, while the actor module only needs to rely on the local perception information to make autonomous decisions during execution, hence improving the effectiveness of the multi-UAV attack-defense game. The simulation results show that the proposed multi-UAV reinforcement learning method has a strong self-evolution property, endowing the UAV certain intelligence, that is, the stable autonomous learning ability. Through continuous training, the UAVs can autonomously learn cooperative attack or defense policies to improve the effectiveness of decision-making.

AB - The attack-defense game is an important combat scenario of future military Unmanned Aerial Vehicles (UAVs). This paper studies an attack-defense game between groups of UAVs with different maneuverability, establishing a multi-UAV cooperative attack and defense evolution model. Based on the multi-agent reinforcement learning theory, the autonomous decision-making method of multi-UAV cooperative attack-defense game is studied, and a centralized critic and distributed actor algorithm structure is proposed based on the actor-critic algorithm, guaranteeing the convergence of the algorithm and improving the efficiency of decision-making. The critic module of UAVs uses the global information to evaluate the decision-making quality during training, while the actor module only needs to rely on the local perception information to make autonomous decisions during execution, hence improving the effectiveness of the multi-UAV attack-defense game. The simulation results show that the proposed multi-UAV reinforcement learning method has a strong self-evolution property, endowing the UAV certain intelligence, that is, the stable autonomous learning ability. Through continuous training, the UAVs can autonomously learn cooperative attack or defense policies to improve the effectiveness of decision-making.

KW - Attack-defense games

KW - Centralized critic

KW - Distributed actors

KW - Multi-UAV coordination

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85098990144&partnerID=8YFLogxK

U2 - 10.7527/S1000-6893.2020.24152

DO - 10.7527/S1000-6893.2020.24152

M3 - 文章

AN - SCOPUS:85098990144

SN - 1000-6893

VL - 41

JO - Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica

JF - Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica

IS - 12

M1 - 324152

ER -

非对称机动能力多无人机智能协同攻防对抗

Abstract

Access to Document

Other files and links

Fingerprint

Cite this