Abstract
To address the limitations of fixed-cycle strategies in uncertain orbital pursuit-evasion games, this study proposes a hierarchical network reinforcement learning approach for variable-cycle decision-making. Firstly, pulse-thrust-controlled spacecraft dynamics and a variable-cycle game environment are constructed, followed by the design of a dual-network architecture to generate actions across heterogeneous decision cycles. Finally, the strategy is trained in simulation, enabling strategy convergence, with cumulative reward trajectories validating the effectiveness of the method.
| Original language | English |
|---|---|
| Pages (from-to) | 2024-2029 |
| Number of pages | 6 |
| Journal | IFAC-PapersOnLine |
| Volume | 59 |
| Issue number | 20 |
| DOIs | |
| Publication status | Published - 1 Aug 2025 |
| Externally published | Yes |
| Event | 23th IFAC Symposium on Automatic Control in Aerospace, ACA 2025 - Harbin, China Duration: 2 Aug 2025 → 6 Aug 2025 |
Keywords
- Control algorithms implementation
- Decision making and autonomy
- guidance and control
- navigation
- Spacecraft
- Spacecraft dynamics