TY - GEN
T1 - Enhanced Multi-Agent Proximal Policy Optimization for Multi-UAV Target Offensive-Defensive Decision
AU - Zheng, Yifan
AU - Xin, Bin
AU - Jiao, Keming
AU - Zhao, Zhixin
AU - Wang, Yuyang
AU - Zhao, Yunming
N1 - Publisher Copyright:
© 2023 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2023
Y1 - 2023
N2 - Autonomous collaborative decision-making is the key technology to achieve large-scale unmanned combat. Focus on the problem of multiple unmanned aerial vehicles' cooperative decision in target offensive and defensive combat, a multi-agent deep reinforcement learning (MADRL) based decision framework is proposed in this paper. Firstly, the simulation environment with a high-fidelity fixed-wing motion model is built. Secondly, to address the issue of high-dimension state space and credit assignment under a multi-agent environment, an enhanced multi-agent proximal policy optimization with mean-field counterfactual advantage (MAPPO - MFCOA) is proposed. Finally, the results of simulation experiments will verify the performance of the proposed approach.
AB - Autonomous collaborative decision-making is the key technology to achieve large-scale unmanned combat. Focus on the problem of multiple unmanned aerial vehicles' cooperative decision in target offensive and defensive combat, a multi-agent deep reinforcement learning (MADRL) based decision framework is proposed in this paper. Firstly, the simulation environment with a high-fidelity fixed-wing motion model is built. Secondly, to address the issue of high-dimension state space and credit assignment under a multi-agent environment, an enhanced multi-agent proximal policy optimization with mean-field counterfactual advantage (MAPPO - MFCOA) is proposed. Finally, the results of simulation experiments will verify the performance of the proposed approach.
KW - deep reinforcement learning
KW - multi-UAV combat
KW - multi-agent system
KW - proximal policy optimization
UR - http://www.scopus.com/inward/record.url?scp=85175533585&partnerID=8YFLogxK
U2 - 10.23919/CCC58697.2023.10240070
DO - 10.23919/CCC58697.2023.10240070
M3 - Conference contribution
AN - SCOPUS:85175533585
T3 - Chinese Control Conference, CCC
SP - 5986
EP - 5991
BT - 2023 42nd Chinese Control Conference, CCC 2023
PB - IEEE Computer Society
T2 - 42nd Chinese Control Conference, CCC 2023
Y2 - 24 July 2023 through 26 July 2023
ER -