Abstract
This paper develops an online adaptive critic algorithm based on policy iteration for partially unknown nonlinear optimal control with infinite horizon cost function. In the proposed method, only a critic network is established, which eliminates the action network, to simplify its architecture. The online least squares support vector machine (LS-SVM) is utilized to approximate the gradient of the associated cost function in the critic network by updating the input-output data. Additionally, a data buffer memory is added to alleviate computational load. Finally, the feasibility of the online learning algorithm is demonstrated in simulation on two example systems.
Original language | English |
---|---|
Pages (from-to) | 104-114 |
Number of pages | 11 |
Journal | Asian Journal of Control |
Volume | 20 |
Issue number | 1 |
DOIs | |
Publication status | Published - Jan 2018 |
Externally published | Yes |
Keywords
- LS-SVM
- adaptive critic
- data-driven
- nonlinear
- optimal control