Deep Reinforcement Learning Based Trajectory Planning for Hopping on Low-Gravity Asteroid Surface

Chang Lv, Zixuan Liang*, Shengying Zhu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In a small-body exploration mission, the rover may deviate from the expected target point due to the dispersion of delivery. This paper proposes a hopping trajectory planning method based on deep reinforcement learning to achieve a precision landing on a low-gravity surface. First, the dynamic model of the hopping rover is established. Then, the hopping scheme is proposed with the attitude angle and angular velocity as control variables. In order to rapidly solve the control variables, the deep reinforcement learning algorithm is utilized for the autonomous hopping trajectory planning. The landing process is divided into an approach and a deceleration stage, and two agents are trained according to the reward functions of the two stages. To achieve the expected attitude angle and angular velocity given by the agents' outputs, the control torque is solved using sliding mode control method. Finally, the hopping trajectory planning method are verified in a landing mission on low-gravity surface. The results show that the rover can reach and stop at the target by intelligent hopping under various initial conditions.

Original languageEnglish
Title of host publicationProceeding - 2021 China Automation Congress, CAC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5353-5358
Number of pages6
ISBN (Electronic)9781665426473
DOIs
Publication statusPublished - 2021
Event2021 China Automation Congress, CAC 2021 - Beijing, China
Duration: 22 Oct 202124 Oct 2021

Publication series

NameProceeding - 2021 China Automation Congress, CAC 2021

Conference

Conference2021 China Automation Congress, CAC 2021
Country/TerritoryChina
CityBeijing
Period22/10/2124/10/21

Keywords

  • Deep reinforcement learning
  • Hopping rover
  • Low-gravity surface
  • Sliding mode control
  • Trajectory planning

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning Based Trajectory Planning for Hopping on Low-Gravity Asteroid Surface'. Together they form a unique fingerprint.

Cite this