Online H∞ control for completely unknown nonlinear systems via an identifier–critic-based ADP structure

Yongfeng Lv; Jing Na; Xuemei Ren

doi:10.1080/00207179.2017.1381763

Online H∞ control for completely unknown nonlinear systems via an identifier–critic-based ADP structure

Yongfeng Lv, Jing Na^*, Xuemei Ren

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

36 Citations (Scopus)

Abstract

In this paper, we propose an identifier–critic-based approximate dynamic programming (ADP) structure to online solve H∞ control problem of nonlinear continuous-time systems without knowing precise system dynamics, where the actor neural network (NN) that has been widely used in the standard ADP learning structure is avoided. We first use an identifier NN to approximate the completely unknown nonlinear system dynamics and disturbances. Then, another critic NN is proposed to approximate the solution of the induced optimal equation. The H∞ control pair is obtained by using the proposed identifier–critic ADP structure. A recently developed adaptation algorithm is used to online directly estimate the unknown NN weights simultaneously, where the convergence to the optimal solution can be rigorously guaranteed, and the stability of the closed-loop system is analysed. Thus, this new ADP scheme can improve the computational efficiency of H∞ control implementation. Finally, simulation results confirm the effectiveness of the proposed methods.

Original language	English
Pages (from-to)	100-111
Number of pages	12
Journal	International Journal of Control
Volume	92
Issue number	1
DOIs	https://doi.org/10.1080/00207179.2017.1381763
Publication status	Published - 2 Jan 2019

Keywords

Approximate dynamic programming
H∞ control
neural networks
nonlinear systems
system identification

Access to Document

10.1080/00207179.2017.1381763

Cite this

@article{e9c5951c640d447187a56110e1fd4428,

title = "Online H∞ control for completely unknown nonlinear systems via an identifier–critic-based ADP structure",

abstract = "In this paper, we propose an identifier–critic-based approximate dynamic programming (ADP) structure to online solve H∞ control problem of nonlinear continuous-time systems without knowing precise system dynamics, where the actor neural network (NN) that has been widely used in the standard ADP learning structure is avoided. We first use an identifier NN to approximate the completely unknown nonlinear system dynamics and disturbances. Then, another critic NN is proposed to approximate the solution of the induced optimal equation. The H∞ control pair is obtained by using the proposed identifier–critic ADP structure. A recently developed adaptation algorithm is used to online directly estimate the unknown NN weights simultaneously, where the convergence to the optimal solution can be rigorously guaranteed, and the stability of the closed-loop system is analysed. Thus, this new ADP scheme can improve the computational efficiency of H∞ control implementation. Finally, simulation results confirm the effectiveness of the proposed methods.",

keywords = "Approximate dynamic programming, H∞ control, neural networks, nonlinear systems, system identification",

author = "Yongfeng Lv and Jing Na and Xuemei Ren",

note = "Publisher Copyright: {\textcopyright} 2017, {\textcopyright} 2017 Informa UK Limited, trading as Taylor & Francis Group.",

year = "2019",

month = jan,

day = "2",

doi = "10.1080/00207179.2017.1381763",

language = "English",

volume = "92",

pages = "100--111",

journal = "International Journal of Control",

issn = "0020-7179",

publisher = "Taylor and Francis Ltd.",

number = "1",

}

TY - JOUR

T1 - Online H∞ control for completely unknown nonlinear systems via an identifier–critic-based ADP structure

AU - Lv, Yongfeng

AU - Na, Jing

AU - Ren, Xuemei

PY - 2019/1/2

Y1 - 2019/1/2

N2 - In this paper, we propose an identifier–critic-based approximate dynamic programming (ADP) structure to online solve H∞ control problem of nonlinear continuous-time systems without knowing precise system dynamics, where the actor neural network (NN) that has been widely used in the standard ADP learning structure is avoided. We first use an identifier NN to approximate the completely unknown nonlinear system dynamics and disturbances. Then, another critic NN is proposed to approximate the solution of the induced optimal equation. The H∞ control pair is obtained by using the proposed identifier–critic ADP structure. A recently developed adaptation algorithm is used to online directly estimate the unknown NN weights simultaneously, where the convergence to the optimal solution can be rigorously guaranteed, and the stability of the closed-loop system is analysed. Thus, this new ADP scheme can improve the computational efficiency of H∞ control implementation. Finally, simulation results confirm the effectiveness of the proposed methods.

AB - In this paper, we propose an identifier–critic-based approximate dynamic programming (ADP) structure to online solve H∞ control problem of nonlinear continuous-time systems without knowing precise system dynamics, where the actor neural network (NN) that has been widely used in the standard ADP learning structure is avoided. We first use an identifier NN to approximate the completely unknown nonlinear system dynamics and disturbances. Then, another critic NN is proposed to approximate the solution of the induced optimal equation. The H∞ control pair is obtained by using the proposed identifier–critic ADP structure. A recently developed adaptation algorithm is used to online directly estimate the unknown NN weights simultaneously, where the convergence to the optimal solution can be rigorously guaranteed, and the stability of the closed-loop system is analysed. Thus, this new ADP scheme can improve the computational efficiency of H∞ control implementation. Finally, simulation results confirm the effectiveness of the proposed methods.

KW - Approximate dynamic programming

KW - H∞ control

KW - neural networks

KW - nonlinear systems

KW - system identification

UR - http://www.scopus.com/inward/record.url?scp=85031503146&partnerID=8YFLogxK

U2 - 10.1080/00207179.2017.1381763

DO - 10.1080/00207179.2017.1381763

M3 - Article

AN - SCOPUS:85031503146

SN - 0020-7179

VL - 92

SP - 100

EP - 111

JO - International Journal of Control

JF - International Journal of Control

IS - 1

ER -

Online H∞ control for completely unknown nonlinear systems via an identifier–critic-based ADP structure

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this