Enhanced Multi-Agent Proximal Policy Optimization for Multi-UAV Target Offensive-Defensive Decision

Yifan Zheng, Bin Xin*, Keming Jiao, Zhixin Zhao, Yuyang Wang, Yunming Zhao

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Autonomous collaborative decision-making is the key technology to achieve large-scale unmanned combat. Focus on the problem of multiple unmanned aerial vehicles' cooperative decision in target offensive and defensive combat, a multi-agent deep reinforcement learning (MADRL) based decision framework is proposed in this paper. Firstly, the simulation environment with a high-fidelity fixed-wing motion model is built. Secondly, to address the issue of high-dimension state space and credit assignment under a multi-agent environment, an enhanced multi-agent proximal policy optimization with mean-field counterfactual advantage (MAPPO - MFCOA) is proposed. Finally, the results of simulation experiments will verify the performance of the proposed approach.

Original languageEnglish
Title of host publication2023 42nd Chinese Control Conference, CCC 2023
PublisherIEEE Computer Society
Pages5986-5991
Number of pages6
ISBN (Electronic)9789887581543
DOIs
Publication statusPublished - 2023
Event42nd Chinese Control Conference, CCC 2023 - Tianjin, China
Duration: 24 Jul 202326 Jul 2023

Publication series

NameChinese Control Conference, CCC
Volume2023-July
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference42nd Chinese Control Conference, CCC 2023
Country/TerritoryChina
CityTianjin
Period24/07/2326/07/23

Keywords

  • deep reinforcement learning
  • multi-UAV combat
  • multi-agent system
  • proximal policy optimization

Fingerprint

Dive into the research topics of 'Enhanced Multi-Agent Proximal Policy Optimization for Multi-UAV Target Offensive-Defensive Decision'. Together they form a unique fingerprint.

Cite this