Abstract
For the cooperative guidance issue of high-hypersonic re-entry gliding vehicles to simultaneously hit a target at a specified angle in a complex environment,a cooperative guidance law based on meta-learning and reinforcement learning algorithms is proposed. Considering the interference caused by complex combat environment,a Markov decision model for the cooperative guidance issue is established, taking the gliding vehicles’ motion status and proportional guidance factor as the state space and action space. A reward function is designed by comprehensively considering the vehicle-target distance, remaining flight time difference, and overload situation for multiple gliding vehicles attacking a target. Based on meta-learning theory and reinforcement learning algorithm, the proximal policy optimization algorithms are combined with the gated recurrent units to learn the common features of similar cooperative guidance tasks. This approach enhances the accuracy of cooperative guidance strategies in complex interference environments to achieve the constraints on angle of attack and attack time,while also improving the adaptability of cooperative guidance strategy to different combat scenarios. Simulated results indicate that the proposed cooperative guidance law enables multiple aerial vehicles to simultaneously attack a target at a specified attack angle in complex battlefield environment and quickly adapt to new cooperative guidance tasks. The cooperative guidance law maintains good performance even when the cooperative combat scenario changes.
| Translated title of the contribution | A Cooperative Guidance Law Based on Meta-learning and Reinforcement Learning for Multiple Aerial Vehicles |
|---|---|
| Original language | Chinese (Traditional) |
| Article number | 240568 |
| Journal | Binggong Xuebao/Acta Armamentarii |
| Volume | 46 |
| Issue number | 7 |
| DOIs | |
| Publication status | Published - 31 Jul 2025 |
| Externally published | Yes |
Fingerprint
Dive into the research topics of 'A Cooperative Guidance Law Based on Meta-learning and Reinforcement Learning for Multiple Aerial Vehicles'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver