Abstract
In the target assignment for multi-missile cooperative operations, there exists uncertainty in the number and variety of enemy platforms and anti-ship missiles, which makes it difficult to model the target assignment algorithm. To improve the effectiveness of attacks under high-dynamic collaborative attack conditions, a dynamic battlefield environment model and a single-round Markov decision model for multi-target assignment were established. An improved deep deterministic policy gradient (DDPG) assignment algorithm was proposed to automatically find the optimal allocation strategy through interaction with the simulator. The algorithm uses the mask method to mask the action space and adapt to the number and type of platforms. The simulation results show that under different defense configurations and configurations of red and blue sides, the performance improvement of the attack strategy obtained by the algorithm was about 87.5% compared with that of the random strategy, and the reasoning time of the model was about 0.04 ms. This research will accelerate the application of DDPG-based methods in intelligent decision-making in high-dynamic environments, and promote the research on cluster autonomous decision-making methods.
Translated title of the contribution | Research on Multi-Target Assignment Method for Clusters Based on Deep Deterministic Policy Gradient Learning |
---|---|
Original language | Chinese (Traditional) |
Pages (from-to) | 1051-1057 |
Number of pages | 7 |
Journal | Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology |
Volume | 44 |
Issue number | 10 |
DOIs | |
Publication status | Published - Oct 2024 |