Adaptive optimal control for a class of uncertain systems with saturating actuators and external disturbance using integral reinforcement learning

Jingang Zhao, Minggang Gan, Jie Chen, Dongyang Hou, Meng Zhang*, Yongqiang Bai

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

The problem of adaptive optimal control for a class of nonlinear uncertain systems with saturating actuators and external disturbance is investigated in this paper. Considering the saturating actuators, a non-quadratic cost function is adopted. The key of this optimal control problem is to find the solution to the Hamilton Jacobi Bellman equation (HJB). An online intergral reinforcement learning (IRL) algorithm based-Neural Network (NN) is given to approximate the solution. Unlike traditional integral reinforcement learning algorithms, data onto a period of time stored together with current data are used to update the neural network weights in place of persistence of excitation (PE) condition. This method overcomes the shortcomings of the PE condition which is not easy to be checked online. Finally, numerical examples are given to show the effectiveness of the proposed methods.

Original languageEnglish
Title of host publication2017 Asian Control Conference, ASCC 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1146-1151
Number of pages6
ISBN (Electronic)9781509015733
DOIs
Publication statusPublished - 7 Feb 2018
Event2017 11th Asian Control Conference, ASCC 2017 - Gold Coast, Australia
Duration: 17 Dec 201720 Dec 2017

Publication series

Name2017 Asian Control Conference, ASCC 2017
Volume2018-January

Conference

Conference2017 11th Asian Control Conference, ASCC 2017
Country/TerritoryAustralia
CityGold Coast
Period17/12/1720/12/17

Fingerprint

Dive into the research topics of 'Adaptive optimal control for a class of uncertain systems with saturating actuators and external disturbance using integral reinforcement learning'. Together they form a unique fingerprint.

Cite this