Robust Low-Thrust Trajectory Design for Interplanetary Spaceflight: An Adaptive Latent Reinforcement Learning Method

Research output: Contribution to journalArticlepeer-review

Abstract

This article investigates the problem of robust trajectory design for low-thrust spacecraft subject to state and observation uncertainties. An adaptive latent reinforcement learning (RL) scheme based on sequential latent variable models (SLVMs) is proposed to address this issue. First, an SLVM is employed for the representation learning of uncertain environments and for predicting future observations. Subsequently, by integrating representation learning based on the SLVM with proximal policy optimization (PPO), a stochastic latent PPO (SLPPO) scheme is introduced. Distinct from existing methods, the control policy is derived from learned stochastic latent variables rather than raw uncertain observations, which effectively mitigates the adverse impact of uncertainties on control performance. Furthermore, to enhance training efficiency, an improved dense reward shaping mechanism is designed based on the observation predictions from the SLVM and adaptive techniques. Finally, numerical simulations of two rendezvous missions validate the effectiveness of the proposed approach.

Original languageEnglish
JournalIEEE Transactions on Cybernetics
DOIs
Publication statusAccepted/In press - 2026
Externally publishedYes

Keywords

  • Latent variable model
  • reinforcement learning (RL)
  • reward function
  • robust trajectory optimization

Fingerprint

Dive into the research topics of 'Robust Low-Thrust Trajectory Design for Interplanetary Spaceflight: An Adaptive Latent Reinforcement Learning Method'. Together they form a unique fingerprint.

Cite this