An Adaptive Dynamic Programming Algorithm Based on ITF-OELM for Discrete-Time Systems

Xiaofei Zhang, Hongbin Ma*, Junyong Chen, Weixue Li

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Adaptive dynamic programming (ADP) is a kind of intelligent control method, and it is a non-model-based method that can directly approximate the optimal control policy via online learning. The gradient algorithm is usually used to update weights of action networks and critic networks, however it is clear that gradient descent-based learning methods are generally very slow due to improper learning steps or may easily converge to local minimum. In this paper, in order to overcome those disadvantages of gradient descent-based learning methods, a novel ADP algorithm based on initial-training-free online extreme learning machine (ITF-OELM), in which the critic network link weights of hidden nodes to output nodes can be obtained by least squares instead of gradient algorithm, is introduced. Finally, the ADP algorithm based on ITF-OELM is tested on a discrete time torsional pendulum system, and simulation results indicate that this algorithm makes the system converge in a shorter time compared with the ADP based on gradient algorithm.

Original languageEnglish
Title of host publicationProceedings of the 33rd Chinese Control and Decision Conference, CCDC 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3006-3011
Number of pages6
ISBN (Electronic)9781665440899
DOIs
Publication statusPublished - 2021
Event33rd Chinese Control and Decision Conference, CCDC 2021 - Kunming, China
Duration: 22 May 202124 May 2021

Publication series

NameProceedings of the 33rd Chinese Control and Decision Conference, CCDC 2021

Conference

Conference33rd Chinese Control and Decision Conference, CCDC 2021
Country/TerritoryChina
CityKunming
Period22/05/2124/05/21

Keywords

  • Adaptive Dynamic Programming
  • Discrete-time Systems
  • Extreme Learning Machine

Fingerprint

Dive into the research topics of 'An Adaptive Dynamic Programming Algorithm Based on ITF-OELM for Discrete-Time Systems'. Together they form a unique fingerprint.

Cite this