Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function

Yongfeng Lv; Jun Zhao; Rong Li; Xuemei Ren

doi:10.1002/rnc.7191

Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function

Yongfeng Lv^*, Jun Zhao, Rong Li, Xuemei Ren

^*此作品的通讯作者

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

9 引用（Scopus）

摘要

Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous-time (CT) multi-input systems with an approximate dynamic programming (ADP) based Q-function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q-functions. First, multi-input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi-input system and the zero-sum-game-based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q-function. The neural network (NN) is applied to learn the solution of the transformed Q-functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q-function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle-point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed-loop stability of the multi-system with the proposed IRL Q-function scheme is analyzed. Finally, a two-engine driven F-16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q-function scheme.

源语言	英语
页（从-至）	4234-4251
页数	18
期刊	International Journal of Robust and Nonlinear Control
卷	34
期	6
DOI	https://doi.org/10.1002/rnc.7191
出版状态	已出版 - 4月 2024

访问文件

10.1002/rnc.7191

其它文件与链接

链接到 Scopus 的出版物

引用此

Lv, Y., Zhao, J., Li, R., & Ren, X. (2024). Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function. International Journal of Robust and Nonlinear Control, 34(6), 4234-4251. https://doi.org/10.1002/rnc.7191

@article{9c1c863f14e647768c62ffcc24c6e7f5,

title = "Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function",

abstract = "Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous-time (CT) multi-input systems with an approximate dynamic programming (ADP) based Q-function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q-functions. First, multi-input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi-input system and the zero-sum-game-based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q-function. The neural network (NN) is applied to learn the solution of the transformed Q-functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q-function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle-point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed-loop stability of the multi-system with the proposed IRL Q-function scheme is analyzed. Finally, a two-engine driven F-16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q-function scheme.",

keywords = "H∞ control, Q-function, integral reinforcement learning, neural network",

author = "Yongfeng Lv and Jun Zhao and Rong Li and Xuemei Ren",

note = "Publisher Copyright: {\textcopyright} 2024 John Wiley & Sons Ltd.",

year = "2024",

month = apr,

doi = "10.1002/rnc.7191",

language = "English",

volume = "34",

pages = "4234--4251",

journal = "International Journal of Robust and Nonlinear Control",

issn = "1049-8923",

publisher = "John Wiley and Sons Ltd",

number = "6",

}

TY - JOUR

T1 - Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function

AU - Lv, Yongfeng

AU - Zhao, Jun

AU - Li, Rong

AU - Ren, Xuemei

PY - 2024/4

Y1 - 2024/4

N2 - Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous-time (CT) multi-input systems with an approximate dynamic programming (ADP) based Q-function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q-functions. First, multi-input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi-input system and the zero-sum-game-based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q-function. The neural network (NN) is applied to learn the solution of the transformed Q-functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q-function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle-point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed-loop stability of the multi-system with the proposed IRL Q-function scheme is analyzed. Finally, a two-engine driven F-16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q-function scheme.

AB - Considering overshoot and chatter caused by the unknown interference, this article studies the adaptive robust optimal controls of continuous-time (CT) multi-input systems with an approximate dynamic programming (ADP) based Q-function scheme. An adaptive integral reinforcement learning (IRL) scheme is proposed to study the optimal solutions of Q-functions. First, multi-input value functions are presented, and Nash equilibrium is analyzed. A complex Hamilton–Jacobi–Issacs (HJI) equation is constructed with the multi-input system and the zero-sum-game-based value function. It is a challenging task to solve the HJI equation for nonlinear system. Thus, A transformation of the HJI equation is constructed as a Q-function. The neural network (NN) is applied to learn the solution of the transformed Q-functions based on the adaptive IRL scheme. Moreover, an error information is added to the Q-function for the issue of insufficient initial incentives to relax the persistent excitation (PE) condition. Simultaneously, an IRL signal of the critic networks is introduced to study the saddle-point intractable solution, such that the system drift and NN derivatives in the HJI equation are relaxed. The convergence of weight parameters is proved, and the closed-loop stability of the multi-system with the proposed IRL Q-function scheme is analyzed. Finally, a two-engine driven F-16 aircraft plant and a nonlinear system are presented to verify the effectiveness of the proposed adaptive IRL Q-function scheme.

KW - H∞ control

KW - Q-function

KW - integral reinforcement learning

KW - neural network

UR - http://www.scopus.com/inward/record.url?scp=85181715168&partnerID=8YFLogxK

U2 - 10.1002/rnc.7191

DO - 10.1002/rnc.7191

M3 - Article

AN - SCOPUS:85181715168

SN - 1049-8923

VL - 34

SP - 4234

EP - 4251

JO - International Journal of Robust and Nonlinear Control

JF - International Journal of Robust and Nonlinear Control

IS - 6

ER -

Robust optimal control of the multi-input systems with unknown disturbance based on adaptive integral reinforcement learning Q-function

摘要

访问文件

其它文件与链接

指纹

引用此