TY - GEN
T1 - Deep Reinforcement Learning Based Trajectory Planning for Hopping on Low-Gravity Asteroid Surface
AU - Lv, Chang
AU - Liang, Zixuan
AU - Zhu, Shengying
N1 - Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - In a small-body exploration mission, the rover may deviate from the expected target point due to the dispersion of delivery. This paper proposes a hopping trajectory planning method based on deep reinforcement learning to achieve a precision landing on a low-gravity surface. First, the dynamic model of the hopping rover is established. Then, the hopping scheme is proposed with the attitude angle and angular velocity as control variables. In order to rapidly solve the control variables, the deep reinforcement learning algorithm is utilized for the autonomous hopping trajectory planning. The landing process is divided into an approach and a deceleration stage, and two agents are trained according to the reward functions of the two stages. To achieve the expected attitude angle and angular velocity given by the agents' outputs, the control torque is solved using sliding mode control method. Finally, the hopping trajectory planning method are verified in a landing mission on low-gravity surface. The results show that the rover can reach and stop at the target by intelligent hopping under various initial conditions.
AB - In a small-body exploration mission, the rover may deviate from the expected target point due to the dispersion of delivery. This paper proposes a hopping trajectory planning method based on deep reinforcement learning to achieve a precision landing on a low-gravity surface. First, the dynamic model of the hopping rover is established. Then, the hopping scheme is proposed with the attitude angle and angular velocity as control variables. In order to rapidly solve the control variables, the deep reinforcement learning algorithm is utilized for the autonomous hopping trajectory planning. The landing process is divided into an approach and a deceleration stage, and two agents are trained according to the reward functions of the two stages. To achieve the expected attitude angle and angular velocity given by the agents' outputs, the control torque is solved using sliding mode control method. Finally, the hopping trajectory planning method are verified in a landing mission on low-gravity surface. The results show that the rover can reach and stop at the target by intelligent hopping under various initial conditions.
KW - Deep reinforcement learning
KW - Hopping rover
KW - Low-gravity surface
KW - Sliding mode control
KW - Trajectory planning
UR - http://www.scopus.com/inward/record.url?scp=85128075254&partnerID=8YFLogxK
U2 - 10.1109/CAC53003.2021.9728343
DO - 10.1109/CAC53003.2021.9728343
M3 - Conference contribution
AN - SCOPUS:85128075254
T3 - Proceeding - 2021 China Automation Congress, CAC 2021
SP - 5353
EP - 5358
BT - Proceeding - 2021 China Automation Congress, CAC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 China Automation Congress, CAC 2021
Y2 - 22 October 2021 through 24 October 2021
ER -