Skip to main navigation Skip to search Skip to main content

Time hopping technique for faster reinforcement learning in simulations

  • Petar Kormushev*
  • , Kohei Nomoto
  • , Fangyan Dong
  • , Kaoru Hirota
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

A technique called Time Hopping is proposed for speeding up reinforcement learning algorithms. It is applicable to continuous optimization problems running in computer simulations. Making shortcuts in time by hopping between distant states combined with off-policy reinforcement learning allows the technique to maintain higher learning rate. Experiments on a simulated biped crawling robot confirm that Time Hopping can accelerate the learning process more than seven times.

Original languageEnglish
Pages (from-to)42-59
Number of pages18
JournalCybernetics and Information Technologies
Volume11
Issue number3
Publication statusPublished - 2011
Externally publishedYes

Keywords

  • Biped robot
  • Computer simulation
  • Discrete time systems
  • Optimization methods
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Time hopping technique for faster reinforcement learning in simulations'. Together they form a unique fingerprint.

Cite this