TY - GEN
T1 - Reentry Trajectory Planning Based on Proximal Policy Optimization
AU - Shi, Xinyu
AU - Deng, Honbin
N1 - Publisher Copyright:
© Beijing HIWING Scientific and Technological Information Institute 2024.
PY - 2024
Y1 - 2024
N2 - In order to deal with the changing flight environment in the process of hypersonic vehicle reentry, a real-time trajectory planning algorithm based on Proximal Policy Optimization (PPO) was proposed. The proximal policy optimization is used to train the hypersonic vehicle reentry process, and the intelligent vehicle that can obtain the best trajectory control output according to the state input is obtained. Continuous roll Angle, discrete roll Angle, and continuous roll Angle change rate are selected as actions to study their training effects on the reentry process. The results show that the action space based on the change rate of the roll Angle converges faster and the reentry flight time is shorter. The trajectory planning method based on proximal strategy optimization can quickly generate the optimal trajectory of high speed aircraft. Compared with the trajectory planning algorithm based on pseudospectral method, the proposed method has the generalization ability to meet the accuracy requirements and can meet the needs of online real-time trajectory planning.
AB - In order to deal with the changing flight environment in the process of hypersonic vehicle reentry, a real-time trajectory planning algorithm based on Proximal Policy Optimization (PPO) was proposed. The proximal policy optimization is used to train the hypersonic vehicle reentry process, and the intelligent vehicle that can obtain the best trajectory control output according to the state input is obtained. Continuous roll Angle, discrete roll Angle, and continuous roll Angle change rate are selected as actions to study their training effects on the reentry process. The results show that the action space based on the change rate of the roll Angle converges faster and the reentry flight time is shorter. The trajectory planning method based on proximal strategy optimization can quickly generate the optimal trajectory of high speed aircraft. Compared with the trajectory planning algorithm based on pseudospectral method, the proposed method has the generalization ability to meet the accuracy requirements and can meet the needs of online real-time trajectory planning.
KW - deep reinforcement learning
KW - hypersonic vehicle
KW - proximal policy optimization
KW - Reentry trajectory planning
UR - http://www.scopus.com/inward/record.url?scp=85192530693&partnerID=8YFLogxK
U2 - 10.1007/978-981-97-1107-9_13
DO - 10.1007/978-981-97-1107-9_13
M3 - Conference contribution
AN - SCOPUS:85192530693
SN - 9789819711062
T3 - Lecture Notes in Electrical Engineering
SP - 144
EP - 153
BT - Proceedings of 3rd 2023 International Conference on Autonomous Unmanned Systems (3rd ICAUS 2023) - Volume I
A2 - Qu, Yi
A2 - Gu, Mancang
A2 - Niu, Yifeng
A2 - Fu, Wenxing
PB - Springer Science and Business Media Deutschland GmbH
T2 - 3rd International Conference on Autonomous Unmanned Systems, ICAUS 2023
Y2 - 9 September 2023 through 11 September 2023
ER -