Online Behavior-Centric Adaptation for Bipedal Robot Sim-to-Real Transfer With Unmodeled Dynamics Mismatch

  • Xuechao Chen
  • , Yidong Du
  • , Zishun Zhou
  • , Zhicheng Yuan
  • , Qingrui Zhao
  • , Fei Meng*
  • , Zhangguo Yu
  • , Peng Lu
  • , Qiang Huang
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Bipedal robots have achieved remarkable locomotion capabilities through reinforcement learning (RL), yet their real-world deployment remains hindered by the sim-to-real gap—dynamics mismatches between simulation and reality that degrade locomotion performance through behavioral deviations. This work introduces an online behavior adaptation framework that bridges this gap at the behavioral level by dynamically aligning emergent locomotion strategies with simulation-derived objectives. Our method integrates two core innovations: (1) a structured latent space constructed via an augmented Variational Autoencoder (VAE), which quantifies behavioral divergence through domain-invariant representations of locomotion patterns, and (2) a closed-loop adaptation module that maps latent-space deviations to real-time adjustments in low-level controller parameters. By reformulating sim-to-real transfer as a problem of behavioral alignment rather than explicit dynamics matching, the framework enables continuous adaptation to unmodeled dynamics mismatch without requiring system identification or offline retraining. Extensive experimental evaluations demonstrate the effectiveness of the proposed method, highlighting its potential to bridge the behavior gap between simulation and reality. Note to Practitioners—This paper was motivated by the problem that the legged robot locomotion with learning-based controller could suffer performance drop due to the sim-to-real gap, which is directly reflected on the behavior deviation between the simulated and real-world robot. Traditional methods require heavy man-crafted parameter tuning. This paper proposes a novel online behavior adaptation framework to alleviate the sim-to-real gap, using a trained robot behavior encoding network and a behavior adaptation network. This framework enables the robot to detects behavioral deviations and adjusts low-level control parameters automatically. The simulated and real-world experiments suggest the proposed framework is feasible to reduce the behavior deviation of real-world robot locomotion with the simulated robot with unmodeled dynamics mismatch by self-correcting locomotion strategies in real time when faced with unexpected disturbances, and reduce manual tuning efforts. But it has not yet been validated on complex terrains, so in future research, we will further validate the proposed framework on complex terrains.

Original languageEnglish
Pages (from-to)1533-1545
Number of pages13
JournalIEEE Transactions on Automation Science and Engineering
Volume23
DOIs
Publication statusPublished - 2026
Externally publishedYes

Keywords

  • Reinforcement learning
  • bipedal robot
  • sim-to-real

Fingerprint

Dive into the research topics of 'Online Behavior-Centric Adaptation for Bipedal Robot Sim-to-Real Transfer With Unmodeled Dynamics Mismatch'. Together they form a unique fingerprint.

Cite this