TY - GEN
T1 - Bipedal Walking Outdoors with a Point-footed Robot via Reinforcement Learning
AU - Zhang, Yuanxi
AU - Yu, Zhangguo
AU - Chen, Xuechao
AU - Du, Yidong
AU - Zhou, Zishun
AU - Gao, Junyao
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The bipedal robot can perfectly adapt to the scenes of human society without the need for additional structural design and motion planning. Although model-based control algorithms achieve multiple types of motion, the control robustness is still insufficient in the face of complex dynamic environments. Reinforcement learning (RL) methods have recently produced good results in the field of bipedal robots. In order to explore information and exploit it better in the robot state space, we introduced a transformer structure that is more sensitive to temporal information into the RL framework. Benefiting from the self-attention mechanism in the transformer, our observation scheme extracts more state features, allowing the bipedal robot to achieve robust outdoor locomotion, including bipedal walking on asphalt, marble and grass terrains, including the test of performance on the most commonly used asphalt according to quantitative data.
AB - The bipedal robot can perfectly adapt to the scenes of human society without the need for additional structural design and motion planning. Although model-based control algorithms achieve multiple types of motion, the control robustness is still insufficient in the face of complex dynamic environments. Reinforcement learning (RL) methods have recently produced good results in the field of bipedal robots. In order to explore information and exploit it better in the robot state space, we introduced a transformer structure that is more sensitive to temporal information into the RL framework. Benefiting from the self-attention mechanism in the transformer, our observation scheme extracts more state features, allowing the bipedal robot to achieve robust outdoor locomotion, including bipedal walking on asphalt, marble and grass terrains, including the test of performance on the most commonly used asphalt according to quantitative data.
KW - bipedal walking
KW - point-footed robot
KW - Reinforcement learning
KW - self-attention
UR - http://www.scopus.com/inward/record.url?scp=85219508903&partnerID=8YFLogxK
U2 - 10.1109/IRAC63143.2024.10871872
DO - 10.1109/IRAC63143.2024.10871872
M3 - Conference contribution
AN - SCOPUS:85219508903
T3 - 2024 International Conference on Intelligent Robotics and Automatic Control, IRAC 2024
SP - 193
EP - 197
BT - 2024 International Conference on Intelligent Robotics and Automatic Control, IRAC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2024 International Conference on Intelligent Robotics and Automatic Control, IRAC 2024
Y2 - 29 November 2024 through 1 December 2024
ER -