基于深度确定性梯度学习的集群多目标分配方法

Qiaoyi Li, Zhengjie Wang*, Xiaoning Zhang, Qiyuan Cheng

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

In the target assignment for multi-missile cooperative operations, there exists uncertainty in the number and variety of enemy platforms and anti-ship missiles, which makes it difficult to model the target assignment algorithm. To improve the effectiveness of attacks under high-dynamic collaborative attack conditions, a dynamic battlefield environment model and a single-round Markov decision model for multi-target assignment were established. An improved deep deterministic policy gradient (DDPG) assignment algorithm was proposed to automatically find the optimal allocation strategy through interaction with the simulator. The algorithm uses the mask method to mask the action space and adapt to the number and type of platforms. The simulation results show that under different defense configurations and configurations of red and blue sides, the performance improvement of the attack strategy obtained by the algorithm was about 87.5% compared with that of the random strategy, and the reasoning time of the model was about 0.04 ms. This research will accelerate the application of DDPG-based methods in intelligent decision-making in high-dynamic environments, and promote the research on cluster autonomous decision-making methods.

投稿的翻译标题Research on Multi-Target Assignment Method for Clusters Based on Deep Deterministic Policy Gradient Learning
源语言繁体中文
页(从-至)1051-1057
页数7
期刊Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
44
10
DOI
出版状态已出版 - 10月 2024

关键词

  • deep deterministic policy gradient (DDPG)
  • dynamic environment
  • Markov decision model
  • multi-missile cooperation
  • target assignment

指纹

探究 '基于深度确定性梯度学习的集群多目标分配方法' 的科研主题。它们共同构成独一无二的指纹。

引用此