Reentry Trajectory Planning Based on Proximal Policy Optimization

Xinyu Shi*, Honbin Deng

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In order to deal with the changing flight environment in the process of hypersonic vehicle reentry, a real-time trajectory planning algorithm based on Proximal Policy Optimization (PPO) was proposed. The proximal policy optimization is used to train the hypersonic vehicle reentry process, and the intelligent vehicle that can obtain the best trajectory control output according to the state input is obtained. Continuous roll Angle, discrete roll Angle, and continuous roll Angle change rate are selected as actions to study their training effects on the reentry process. The results show that the action space based on the change rate of the roll Angle converges faster and the reentry flight time is shorter. The trajectory planning method based on proximal strategy optimization can quickly generate the optimal trajectory of high speed aircraft. Compared with the trajectory planning algorithm based on pseudospectral method, the proposed method has the generalization ability to meet the accuracy requirements and can meet the needs of online real-time trajectory planning.

Original languageEnglish
Title of host publicationProceedings of 3rd 2023 International Conference on Autonomous Unmanned Systems (3rd ICAUS 2023) - Volume I
EditorsYi Qu, Mancang Gu, Yifeng Niu, Wenxing Fu
PublisherSpringer Science and Business Media Deutschland GmbH
Pages144-153
Number of pages10
ISBN (Print)9789819711062
DOIs
Publication statusPublished - 2024
Event3rd International Conference on Autonomous Unmanned Systems, ICAUS 2023 - Nanjing, China
Duration: 9 Sept 202311 Sept 2023

Publication series

NameLecture Notes in Electrical Engineering
Volume1170
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference3rd International Conference on Autonomous Unmanned Systems, ICAUS 2023
Country/TerritoryChina
CityNanjing
Period9/09/2311/09/23

Keywords

  • deep reinforcement learning
  • hypersonic vehicle
  • proximal policy optimization
  • Reentry trajectory planning

Cite this