TY - JOUR
T1 - Adaptive optimal tracking controls of unknown multi-input systems based on nonzero-sum game theory
AU - Lv, Yongfeng
AU - Ren, Xuemei
AU - Na, Jing
N1 - Publisher Copyright:
© 2019 The Franklin Institute
PY - 2019/10
Y1 - 2019/10
N2 - This paper focuses on the optimal tracking control problem (OTCP) for the unknown multi-input system by using a reinforcement learning (RL) scheme and nonzero-sum (NZS) game theory. First, a generic method for the OTCP of multi-input systems is formulated with steady-state controls and optimal feedback controls based on the NZS game theory. Then a three-layer neural network (NN) identifier is introduced to approximate the unknown system, and the input dynamics are obtained by using the derivative of the identifier. To transform the OTCP into a regulation optimal problem, an augmentation of the multi-input system is constructed by using the tracking error and the commanded trajectory. Moreover, we use an NN-based RL method to online learn the optimal value functions of all the inputs, which are then directly used to calculate the optimal tracking controls. All the NN weights are tuned synchronously online with a newly introduced adaptation based on the estimation error. The convergence of the NN weights and the stability of the closed-loop system are analyzed. Finally, a two-motor driven servo system and another nonlinear system are presented to illustrate the feasibility of the algorithm for both linear and nonlinear multi-input systems.
AB - This paper focuses on the optimal tracking control problem (OTCP) for the unknown multi-input system by using a reinforcement learning (RL) scheme and nonzero-sum (NZS) game theory. First, a generic method for the OTCP of multi-input systems is formulated with steady-state controls and optimal feedback controls based on the NZS game theory. Then a three-layer neural network (NN) identifier is introduced to approximate the unknown system, and the input dynamics are obtained by using the derivative of the identifier. To transform the OTCP into a regulation optimal problem, an augmentation of the multi-input system is constructed by using the tracking error and the commanded trajectory. Moreover, we use an NN-based RL method to online learn the optimal value functions of all the inputs, which are then directly used to calculate the optimal tracking controls. All the NN weights are tuned synchronously online with a newly introduced adaptation based on the estimation error. The convergence of the NN weights and the stability of the closed-loop system are analyzed. Finally, a two-motor driven servo system and another nonlinear system are presented to illustrate the feasibility of the algorithm for both linear and nonlinear multi-input systems.
UR - http://www.scopus.com/inward/record.url?scp=85072355362&partnerID=8YFLogxK
U2 - 10.1016/j.jfranklin.2019.07.022
DO - 10.1016/j.jfranklin.2019.07.022
M3 - Article
AN - SCOPUS:85072355362
SN - 0016-0032
VL - 356
SP - 8255
EP - 8277
JO - Journal of the Franklin Institute
JF - Journal of the Franklin Institute
IS - 15
ER -