TY - GEN
T1 - Off-Policy Q-Learning for Infinite Horizon LQR Problem with Unknown Dynamics
AU - Li, Xinxing
AU - Peng, Zhihong
AU - Liang, Li
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/8/10
Y1 - 2018/8/10
N2 - In this paper, a novel online Q-Iearning approach is proposed to solve the Infinite Horizon Linear Quadratic Regulator (IHLQR) problem for continuous-time (CT) linear time-invariant (LMI) systems. The proposed Q-Iearning algorithm employing off-policy reinforcement learning (RL) technology improves the exploration ability of Q-Iearning to the state space. During the learning process, the Q-Iearning algorithm can be implemented just using the data sets which just contains the information of the behavior policy and the corresponding system state, thus is data- driven. Moreover, the data sets can be used repeatedly, which is computationally efficient. A mild condition on probing noise is established to ensure the converge of the proposed Q-Iearning algorithm. Simulation results demonstrate the effectiveness of the developed algorithm.
AB - In this paper, a novel online Q-Iearning approach is proposed to solve the Infinite Horizon Linear Quadratic Regulator (IHLQR) problem for continuous-time (CT) linear time-invariant (LMI) systems. The proposed Q-Iearning algorithm employing off-policy reinforcement learning (RL) technology improves the exploration ability of Q-Iearning to the state space. During the learning process, the Q-Iearning algorithm can be implemented just using the data sets which just contains the information of the behavior policy and the corresponding system state, thus is data- driven. Moreover, the data sets can be used repeatedly, which is computationally efficient. A mild condition on probing noise is established to ensure the converge of the proposed Q-Iearning algorithm. Simulation results demonstrate the effectiveness of the developed algorithm.
UR - http://www.scopus.com/inward/record.url?scp=85052365602&partnerID=8YFLogxK
U2 - 10.1109/ISIE.2018.8433684
DO - 10.1109/ISIE.2018.8433684
M3 - Conference contribution
AN - SCOPUS:85052365602
SN - 9781538637050
T3 - IEEE International Symposium on Industrial Electronics
SP - 258
EP - 263
BT - Proceedings - 2018 IEEE 27th International Symposium on Industrial Electronics, ISIE 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 27th IEEE International Symposium on Industrial Electronics, ISIE 2018
Y2 - 13 June 2018 through 15 June 2018
ER -