摘要
In the target assignment for multi-missile cooperative operations, there exists uncertainty in the number and variety of enemy platforms and anti-ship missiles, which makes it difficult to model the target assignment algorithm. To improve the effectiveness of attacks under high-dynamic collaborative attack conditions, a dynamic battlefield environment model and a single-round Markov decision model for multi-target assignment were established. An improved deep deterministic policy gradient (DDPG) assignment algorithm was proposed to automatically find the optimal allocation strategy through interaction with the simulator. The algorithm uses the mask method to mask the action space and adapt to the number and type of platforms. The simulation results show that under different defense configurations and configurations of red and blue sides, the performance improvement of the attack strategy obtained by the algorithm was about 87.5% compared with that of the random strategy, and the reasoning time of the model was about 0.04 ms. This research will accelerate the application of DDPG-based methods in intelligent decision-making in high-dynamic environments, and promote the research on cluster autonomous decision-making methods.
投稿的翻译标题 | Research on Multi-Target Assignment Method for Clusters Based on Deep Deterministic Policy Gradient Learning |
---|---|
源语言 | 繁体中文 |
页(从-至) | 1051-1057 |
页数 | 7 |
期刊 | Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology |
卷 | 44 |
期 | 10 |
DOI | |
出版状态 | 已出版 - 10月 2024 |
关键词
- deep deterministic policy gradient (DDPG)
- dynamic environment
- Markov decision model
- multi-missile cooperation
- target assignment