Research on fast reinforcement learning

Liang Tong*, Ji Lian Lu, Jian Wei Gong

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Based on eligibility trace theory, a delayed fast reinforcement learning algorithm DFSARSA (λ) is proposed. By redefining the eligibility trace and tracking the TD (λ) error, the Q-value of reinforcement learning updates may be postponed when they are needed instead of update in each step as traditional SARSA (λ). The update computing complexity is reduced from O (|S| |A|) to O (|A|) compared with SARSA (λ) and the speed of the reinforcement learning is improved greatly. Simulation results show the method is validity.

Original languageEnglish
Pages (from-to)328-331
Number of pages4
JournalBeijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
Volume25
Issue number4
Publication statusPublished - Apr 2005

Keywords

  • DFSARSA (λ) algorithm
  • Eligibility trace
  • Reinforcement learning
  • SARSA (λ) algorithm

Fingerprint

Dive into the research topics of 'Research on fast reinforcement learning'. Together they form a unique fingerprint.

Cite this