TY - JOUR
T1 - Online H∞ control for completely unknown nonlinear systems via an identifier–critic-based ADP structure
AU - Lv, Yongfeng
AU - Na, Jing
AU - Ren, Xuemei
N1 - Publisher Copyright:
© 2017, © 2017 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2019/1/2
Y1 - 2019/1/2
N2 - In this paper, we propose an identifier–critic-based approximate dynamic programming (ADP) structure to online solve H∞ control problem of nonlinear continuous-time systems without knowing precise system dynamics, where the actor neural network (NN) that has been widely used in the standard ADP learning structure is avoided. We first use an identifier NN to approximate the completely unknown nonlinear system dynamics and disturbances. Then, another critic NN is proposed to approximate the solution of the induced optimal equation. The H∞ control pair is obtained by using the proposed identifier–critic ADP structure. A recently developed adaptation algorithm is used to online directly estimate the unknown NN weights simultaneously, where the convergence to the optimal solution can be rigorously guaranteed, and the stability of the closed-loop system is analysed. Thus, this new ADP scheme can improve the computational efficiency of H∞ control implementation. Finally, simulation results confirm the effectiveness of the proposed methods.
AB - In this paper, we propose an identifier–critic-based approximate dynamic programming (ADP) structure to online solve H∞ control problem of nonlinear continuous-time systems without knowing precise system dynamics, where the actor neural network (NN) that has been widely used in the standard ADP learning structure is avoided. We first use an identifier NN to approximate the completely unknown nonlinear system dynamics and disturbances. Then, another critic NN is proposed to approximate the solution of the induced optimal equation. The H∞ control pair is obtained by using the proposed identifier–critic ADP structure. A recently developed adaptation algorithm is used to online directly estimate the unknown NN weights simultaneously, where the convergence to the optimal solution can be rigorously guaranteed, and the stability of the closed-loop system is analysed. Thus, this new ADP scheme can improve the computational efficiency of H∞ control implementation. Finally, simulation results confirm the effectiveness of the proposed methods.
KW - Approximate dynamic programming
KW - H∞ control
KW - neural networks
KW - nonlinear systems
KW - system identification
UR - http://www.scopus.com/inward/record.url?scp=85031503146&partnerID=8YFLogxK
U2 - 10.1080/00207179.2017.1381763
DO - 10.1080/00207179.2017.1381763
M3 - Article
AN - SCOPUS:85031503146
SN - 0020-7179
VL - 92
SP - 100
EP - 111
JO - International Journal of Control
JF - International Journal of Control
IS - 1
ER -