Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function

Yongfeng Lv; Jun Zhao; Rong Li; Xuemei Ren

doi:10.1002/rnc.7191

Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function

Yongfeng Lv^*, Jun Zhao, Rong Li, Xuemei Ren

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

6 Citations (Scopus)

Abstract

Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous-time (CT) multi-input systems with an approximate dynamic programming (ADP) based Q-function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q-functions. First, multi-input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi-input system and the zero-sum-game-based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q-function. The neural network (NN) is applied to learn the solution of the transformed Q-functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q-function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle-point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed-loop stability of the multi-system with the proposed IRL Q-function scheme is analyzed. Finally, a two-engine driven F-16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q-function scheme.

Original language	English
Pages (from-to)	4234-4251
Number of pages	18
Journal	International Journal of Robust and Nonlinear Control
Volume	34
Issue number	6
DOIs	https://doi.org/10.1002/rnc.7191
Publication status	Published - Apr 2024

Keywords

H∞ control
Q-function
integral reinforcement learning
neural network

Access to Document

10.1002/rnc.7191

Cite this

@article{9c1c863f14e647768c62ffcc24c6e7f5,

title = "Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function",

abstract = "Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous-time (CT) multi-input systems with an approximate dynamic programming (ADP) based Q-function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q-functions. First, multi-input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi-input system and the zero-sum-game-based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q-function. The neural network (NN) is applied to learn the solution of the transformed Q-functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q-function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle-point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed-loop stability of the multi-system with the proposed IRL Q-function scheme is analyzed. Finally, a two-engine driven F-16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q-function scheme.",

keywords = "H∞ control, Q-function, integral reinforcement learning, neural network",

author = "Yongfeng Lv and Jun Zhao and Rong Li and Xuemei Ren",

note = "Publisher Copyright: {\textcopyright} 2024 John Wiley & Sons Ltd.",

year = "2024",

month = apr,

doi = "10.1002/rnc.7191",

language = "English",

volume = "34",

pages = "4234--4251",

journal = "International Journal of Robust and Nonlinear Control",

issn = "1049-8923",

publisher = "John Wiley and Sons Ltd",

number = "6",

}

TY - JOUR

T1 - Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function

AU - Lv, Yongfeng

AU - Zhao, Jun

AU - Li, Rong

AU - Ren, Xuemei

PY - 2024/4

Y1 - 2024/4

N2 - Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous-time (CT) multi-input systems with an approximate dynamic programming (ADP) based Q-function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q-functions. First, multi-input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi-input system and the zero-sum-game-based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q-function. The neural network (NN) is applied to learn the solution of the transformed Q-functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q-function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle-point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed-loop stability of the multi-system with the proposed IRL Q-function scheme is analyzed. Finally, a two-engine driven F-16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q-function scheme.

AB - Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous-time (CT) multi-input systems with an approximate dynamic programming (ADP) based Q-function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q-functions. First, multi-input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi-input system and the zero-sum-game-based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q-function. The neural network (NN) is applied to learn the solution of the transformed Q-functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q-function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle-point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed-loop stability of the multi-system with the proposed IRL Q-function scheme is analyzed. Finally, a two-engine driven F-16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q-function scheme.

KW - H∞ control

KW - Q-function

KW - integral reinforcement learning

KW - neural network

UR - http://www.scopus.com/inward/record.url?scp=85181715168&partnerID=8YFLogxK

U2 - 10.1002/rnc.7191

DO - 10.1002/rnc.7191

M3 - Article

AN - SCOPUS:85181715168

SN - 1049-8923

VL - 34

SP - 4234

EP - 4251

JO - International Journal of Robust and Nonlinear Control

JF - International Journal of Robust and Nonlinear Control

IS - 6

ER -

Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this