Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

Yuanxi Zhang; Xuechao Chen; Fei Meng; Zhangguo Yu; Yidong Du; Junyao Gao; Qiang Huang

doi:10.1007/s42235-023-00452-9

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

Yuanxi Zhang, Xuechao Chen, Fei Meng^*, Zhangguo Yu, Yidong Du, Junyao Gao, Qiang Huang

^*Corresponding author for this work

School of Mechatronical Engineering

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Reinforcement learning (RL) provides much potential for locomotion of legged robot. Due to the gap between simulation and the real world, achieving sim-to-real for legged robots is challenging. However, the support polygon of legged robots can help to overcome some of these challenges. Quadruped robot has a considerable support polygon, followed by bipedal robot with actuated feet, and point-footed bipedal robot has the smallest support polygon. Therefore, despite the existing sim-to-real gap, most of the recent RL approaches are deployed to the real quadruped robots that are inherently more stable, while the RL-based locomotion of bipedal robot is challenged by zero-shot sim-to-real task. Especially for the point-footed one that gets better dynamic performance, the inevitable tumble brings extra barriers to sim-to-real task. Actually, the crux of this type of problem is the difference of mechanics properties between the physical robot and the simulated one, making it difficult to play the learned skills well on the physical bipedal robot. In this paper, we introduce the embedded mechanics properties (EMP) based on the optimization with Gaussian processes to RL training, making it possible to perform sim-to-real transfer on the BRS1-P robot used in this work, hence the trained policy can be deployed on the BRS1-P without any struggle. We validate the performance of the learning-based BRS1-P on the condition of disturbances and terrains not ever learned, demonstrating the bipedal locomotion and resistant performance.

Original language	English
Journal	Journal of Bionic Engineering
DOIs	https://doi.org/10.1007/s42235-023-00452-9
Publication status	Accepted/In press - 2024

Keywords

Bipedal robot
Mechanics properties
Reinforcement learning
Sim-to-real

Access to Document

10.1007/s42235-023-00452-9

Cite this

Zhang, Y., Chen, X., Meng, F., Yu, Z., Du, Y., Gao, J., & Huang, Q. (Accepted/In press). Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties. Journal of Bionic Engineering. https://doi.org/10.1007/s42235-023-00452-9

@article{9a3c670c00dc4d5597934e4ae274fcfd,

title = "Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties",

abstract = "Reinforcement learning (RL) provides much potential for locomotion of legged robot. Due to the gap between simulation and the real world, achieving sim-to-real for legged robots is challenging. However, the support polygon of legged robots can help to overcome some of these challenges. Quadruped robot has a considerable support polygon, followed by bipedal robot with actuated feet, and point-footed bipedal robot has the smallest support polygon. Therefore, despite the existing sim-to-real gap, most of the recent RL approaches are deployed to the real quadruped robots that are inherently more stable, while the RL-based locomotion of bipedal robot is challenged by zero-shot sim-to-real task. Especially for the point-footed one that gets better dynamic performance, the inevitable tumble brings extra barriers to sim-to-real task. Actually, the crux of this type of problem is the difference of mechanics properties between the physical robot and the simulated one, making it difficult to play the learned skills well on the physical bipedal robot. In this paper, we introduce the embedded mechanics properties (EMP) based on the optimization with Gaussian processes to RL training, making it possible to perform sim-to-real transfer on the BRS1-P robot used in this work, hence the trained policy can be deployed on the BRS1-P without any struggle. We validate the performance of the learning-based BRS1-P on the condition of disturbances and terrains not ever learned, demonstrating the bipedal locomotion and resistant performance.",

keywords = "Bipedal robot, Mechanics properties, Reinforcement learning, Sim-to-real",

author = "Yuanxi Zhang and Xuechao Chen and Fei Meng and Zhangguo Yu and Yidong Du and Junyao Gao and Qiang Huang",

note = "Publisher Copyright: {\textcopyright} 2024, Jilin University.",

year = "2024",

doi = "10.1007/s42235-023-00452-9",

language = "English",

journal = "Journal of Bionic Engineering",

issn = "1672-6529",

publisher = "Springer",

}

TY - JOUR

T1 - Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

AU - Zhang, Yuanxi

AU - Chen, Xuechao

AU - Meng, Fei

AU - Yu, Zhangguo

AU - Du, Yidong

AU - Gao, Junyao

AU - Huang, Qiang

PY - 2024

Y1 - 2024

N2 - Reinforcement learning (RL) provides much potential for locomotion of legged robot. Due to the gap between simulation and the real world, achieving sim-to-real for legged robots is challenging. However, the support polygon of legged robots can help to overcome some of these challenges. Quadruped robot has a considerable support polygon, followed by bipedal robot with actuated feet, and point-footed bipedal robot has the smallest support polygon. Therefore, despite the existing sim-to-real gap, most of the recent RL approaches are deployed to the real quadruped robots that are inherently more stable, while the RL-based locomotion of bipedal robot is challenged by zero-shot sim-to-real task. Especially for the point-footed one that gets better dynamic performance, the inevitable tumble brings extra barriers to sim-to-real task. Actually, the crux of this type of problem is the difference of mechanics properties between the physical robot and the simulated one, making it difficult to play the learned skills well on the physical bipedal robot. In this paper, we introduce the embedded mechanics properties (EMP) based on the optimization with Gaussian processes to RL training, making it possible to perform sim-to-real transfer on the BRS1-P robot used in this work, hence the trained policy can be deployed on the BRS1-P without any struggle. We validate the performance of the learning-based BRS1-P on the condition of disturbances and terrains not ever learned, demonstrating the bipedal locomotion and resistant performance.

AB - Reinforcement learning (RL) provides much potential for locomotion of legged robot. Due to the gap between simulation and the real world, achieving sim-to-real for legged robots is challenging. However, the support polygon of legged robots can help to overcome some of these challenges. Quadruped robot has a considerable support polygon, followed by bipedal robot with actuated feet, and point-footed bipedal robot has the smallest support polygon. Therefore, despite the existing sim-to-real gap, most of the recent RL approaches are deployed to the real quadruped robots that are inherently more stable, while the RL-based locomotion of bipedal robot is challenged by zero-shot sim-to-real task. Especially for the point-footed one that gets better dynamic performance, the inevitable tumble brings extra barriers to sim-to-real task. Actually, the crux of this type of problem is the difference of mechanics properties between the physical robot and the simulated one, making it difficult to play the learned skills well on the physical bipedal robot. In this paper, we introduce the embedded mechanics properties (EMP) based on the optimization with Gaussian processes to RL training, making it possible to perform sim-to-real transfer on the BRS1-P robot used in this work, hence the trained policy can be deployed on the BRS1-P without any struggle. We validate the performance of the learning-based BRS1-P on the condition of disturbances and terrains not ever learned, demonstrating the bipedal locomotion and resistant performance.

KW - Bipedal robot

KW - Mechanics properties

KW - Reinforcement learning

KW - Sim-to-real

UR - http://www.scopus.com/inward/record.url?scp=85182678076&partnerID=8YFLogxK

U2 - 10.1007/s42235-023-00452-9

DO - 10.1007/s42235-023-00452-9

M3 - Article

AN - SCOPUS:85182678076

SN - 1672-6529

JO - Journal of Bionic Engineering

JF - Journal of Bionic Engineering

ER -

Learning Robust Locomotion for Bipedal Robot via Embedded Mechanics Properties

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this