An Adaptive Dynamic Programming Algorithm Based on ITF-OELM for Discrete-Time Systems

Xiaofei Zhang, Hongbin Ma*, Junyong Chen, Weixue Li

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Adaptive dynamic programming (ADP) is a kind of intelligent control method, and it is a non-model-based method that can directly approximate the optimal control policy via online learning. The gradient algorithm is usually used to update weights of action networks and critic networks, however it is clear that gradient descent-based learning methods are generally very slow due to improper learning steps or may easily converge to local minimum. In this paper, in order to overcome those disadvantages of gradient descent-based learning methods, a novel ADP algorithm based on initial-training-free online extreme learning machine (ITF-OELM), in which the critic network link weights of hidden nodes to output nodes can be obtained by least squares instead of gradient algorithm, is introduced. Finally, the ADP algorithm based on ITF-OELM is tested on a discrete time torsional pendulum system, and simulation results indicate that this algorithm makes the system converge in a shorter time compared with the ADP based on gradient algorithm.

源语言英语
主期刊名Proceedings of the 33rd Chinese Control and Decision Conference, CCDC 2021
出版商Institute of Electrical and Electronics Engineers Inc.
3006-3011
页数6
ISBN(电子版)9781665440899
DOI
出版状态已出版 - 2021
活动33rd Chinese Control and Decision Conference, CCDC 2021 - Kunming, 中国
期限: 22 5月 202124 5月 2021

出版系列

姓名Proceedings of the 33rd Chinese Control and Decision Conference, CCDC 2021

会议

会议33rd Chinese Control and Decision Conference, CCDC 2021
国家/地区中国
Kunming
时期22/05/2124/05/21

指纹

探究 'An Adaptive Dynamic Programming Algorithm Based on ITF-OELM for Discrete-Time Systems' 的科研主题。它们共同构成独一无二的指纹。

引用此