TY - GEN
T1 - Neural network tracking control of unknown servo system with approximate dynamic programming
AU - Lv, Yongfeng
AU - Ren, Xuemei
AU - Zeng, Tianyi
AU - Li, Linwei
AU - Na, Jing
N1 - Publisher Copyright:
© 2019 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2019/7
Y1 - 2019/7
N2 - Although the adaptive dynamic programming (ADP) scheme has been widely researched on the optimal problem in recent years, which has not been applied to the servo system. In this paper, a simplified reinforcement learning (RL) based (ADP) scheme is developed to obtain the optimal tracking control of the servo system, where the unknown system dynamics are approximated with a three-layer neural network (NN) identifier. First, the servo system model is constructed and a three-layer NN identifier is used to approximate the unknown servo system. The NN weights of both the hidden layer and output layer are synchronously tuned with an adaptive gradient law. An RL-based critic NN is then used to learn the optimal cost function, and NN weights are updated by minimizing the squared Hamilton-Jacobi-Bellman (HJB) error. The optimal tracking control of the servomechanism is obtained based on the three-layer NN identifier and RL scheme, which can make the motor speed track the predefined command. Moreover, the convergence of the identifier and NN weights is proved. Finally, a servomechanism model is provided, which can illustrate the proposed methods.
AB - Although the adaptive dynamic programming (ADP) scheme has been widely researched on the optimal problem in recent years, which has not been applied to the servo system. In this paper, a simplified reinforcement learning (RL) based (ADP) scheme is developed to obtain the optimal tracking control of the servo system, where the unknown system dynamics are approximated with a three-layer neural network (NN) identifier. First, the servo system model is constructed and a three-layer NN identifier is used to approximate the unknown servo system. The NN weights of both the hidden layer and output layer are synchronously tuned with an adaptive gradient law. An RL-based critic NN is then used to learn the optimal cost function, and NN weights are updated by minimizing the squared Hamilton-Jacobi-Bellman (HJB) error. The optimal tracking control of the servomechanism is obtained based on the three-layer NN identifier and RL scheme, which can make the motor speed track the predefined command. Moreover, the convergence of the identifier and NN weights is proved. Finally, a servomechanism model is provided, which can illustrate the proposed methods.
KW - Adaptive Dynamic Programming
KW - Neural Networks
KW - Optimal Control
KW - Reinforcement Learning
KW - Servomechanisms
UR - http://www.scopus.com/inward/record.url?scp=85074391806&partnerID=8YFLogxK
U2 - 10.23919/ChiCC.2019.8865727
DO - 10.23919/ChiCC.2019.8865727
M3 - Conference contribution
AN - SCOPUS:85074391806
T3 - Chinese Control Conference, CCC
SP - 2460
EP - 2465
BT - Proceedings of the 38th Chinese Control Conference, CCC 2019
A2 - Fu, Minyue
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 38th Chinese Control Conference, CCC 2019
Y2 - 27 July 2019 through 30 July 2019
ER -