TY - GEN
T1 - Optimization of Multi-Platform Dynamic Weapon-Target Assignment Based on Multi-Agent Reinforcement Learning
AU - Wang, Haoran
AU - Wang, Qing
AU - Xin, Bin
AU - Wang, Yujue
N1 - Publisher Copyright:
© 2025 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2025
Y1 - 2025
N2 - This paper focuses on the 'Weapon Platform-Weapon-Target Assignment' (W-WTA) Problem, aiming to minimize the expected threat values of incoming targets during combat while considering the minimal consumption of resources for effective resource allocation. To solve this problem, we propose a multi-agent reinforcement learning method based on the attention mechanism. We have established threat value model for targets and damage probability model. Each weapon platform is characterized by distinct attributes, such as range, weapon type and the number of missiles, etc. The multi-head attention mechanism integrates target information and weapon platform information to adjust the output of each agent, enabling each agent to consider the overall combat effectiveness. Experimental results indicate the in the most cases, the proposed algorithm demonstrates superior performance with a lower time cost. Compared with the Genetic Algorithm (GA) and Greedy Algorithm, it shows outstanding effectiveness in solving the W-WTA problem. This research takes into account the actual combat process and provides an advanced and reliable method for military operation decision-making.
AB - This paper focuses on the 'Weapon Platform-Weapon-Target Assignment' (W-WTA) Problem, aiming to minimize the expected threat values of incoming targets during combat while considering the minimal consumption of resources for effective resource allocation. To solve this problem, we propose a multi-agent reinforcement learning method based on the attention mechanism. We have established threat value model for targets and damage probability model. Each weapon platform is characterized by distinct attributes, such as range, weapon type and the number of missiles, etc. The multi-head attention mechanism integrates target information and weapon platform information to adjust the output of each agent, enabling each agent to consider the overall combat effectiveness. Experimental results indicate the in the most cases, the proposed algorithm demonstrates superior performance with a lower time cost. Compared with the Genetic Algorithm (GA) and Greedy Algorithm, it shows outstanding effectiveness in solving the W-WTA problem. This research takes into account the actual combat process and provides an advanced and reliable method for military operation decision-making.
KW - Dynamic Weapon-Target Assignment
KW - Multi-Weapon Platform
KW - Reinforcement Learning
UR - https://www.scopus.com/pages/publications/105020286796
U2 - 10.23919/CCC64809.2025.11179489
DO - 10.23919/CCC64809.2025.11179489
M3 - Conference contribution
AN - SCOPUS:105020286796
T3 - Chinese Control Conference, CCC
SP - 2322
EP - 2327
BT - Proceedings of the 44th Chinese Control Conference, CCC 2025
A2 - Sun, Jian
A2 - Yin, Hongpeng
PB - IEEE Computer Society
T2 - 44th Chinese Control Conference, CCC 2025
Y2 - 28 July 2025 through 30 July 2025
ER -