TY - JOUR
T1 - Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties
AU - Zhang, Yuanxi
AU - Chen, Xuechao
AU - Meng, Fei
AU - Yu, Zhangguo
AU - Du, Yidong
AU - Gao, Junyao
AU - Huang, Qiang
N1 - Publisher Copyright:
© 2024, Jilin University.
PY - 2024
Y1 - 2024
N2 - Reinforcement learning (RL) provides much potential for locomotion of legged robot. Due to the gap between simulation and the real world, achieving sim-to-real for legged robots is challenging. However, the support polygon of legged robots can help to overcome some of these challenges. Quadruped robot has a considerable support polygon, followed by bipedal robot with actuated feet, and point-footed bipedal robot has the smallest support polygon. Therefore, despite the existing sim-to-real gap, most of the recent RL approaches are deployed to the real quadruped robots that are inherently more stable, while the RL-based locomotion of bipedal robot is challenged by zero-shot sim-to-real task. Especially for the point-footed one that gets better dynamic performance, the inevitable tumble brings extra barriers to sim-to-real task. Actually, the crux of this type of problem is the difference of mechanics properties between the physical robot and the simulated one, making it difficult to play the learned skills well on the physical bipedal robot. In this paper, we introduce the embedded mechanics properties (EMP) based on the optimization with Gaussian processes to RL training, making it possible to perform sim-to-real transfer on the BRS1-P robot used in this work, hence the trained policy can be deployed on the BRS1-P without any struggle. We validate the performance of the learning-based BRS1-P on the condition of disturbances and terrains not ever learned, demonstrating the bipedal locomotion and resistant performance.
AB - Reinforcement learning (RL) provides much potential for locomotion of legged robot. Due to the gap between simulation and the real world, achieving sim-to-real for legged robots is challenging. However, the support polygon of legged robots can help to overcome some of these challenges. Quadruped robot has a considerable support polygon, followed by bipedal robot with actuated feet, and point-footed bipedal robot has the smallest support polygon. Therefore, despite the existing sim-to-real gap, most of the recent RL approaches are deployed to the real quadruped robots that are inherently more stable, while the RL-based locomotion of bipedal robot is challenged by zero-shot sim-to-real task. Especially for the point-footed one that gets better dynamic performance, the inevitable tumble brings extra barriers to sim-to-real task. Actually, the crux of this type of problem is the difference of mechanics properties between the physical robot and the simulated one, making it difficult to play the learned skills well on the physical bipedal robot. In this paper, we introduce the embedded mechanics properties (EMP) based on the optimization with Gaussian processes to RL training, making it possible to perform sim-to-real transfer on the BRS1-P robot used in this work, hence the trained policy can be deployed on the BRS1-P without any struggle. We validate the performance of the learning-based BRS1-P on the condition of disturbances and terrains not ever learned, demonstrating the bipedal locomotion and resistant performance.
KW - Bipedal robot
KW - Mechanics properties
KW - Reinforcement learning
KW - Sim-to-real
UR - http://www.scopus.com/inward/record.url?scp=85182678076&partnerID=8YFLogxK
U2 - 10.1007/s42235-023-00452-9
DO - 10.1007/s42235-023-00452-9
M3 - Article
AN - SCOPUS:85182678076
SN - 1672-6529
JO - Journal of Bionic Engineering
JF - Journal of Bionic Engineering
ER -