Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning

Jingang Zhao, Minggang Gan*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

13 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 13
  • Captures
    • Readers: 17
see details

摘要

This paper investigates finite-horizon optimal control problem of continuous-time uncertain nonlinear systems. The uncertainty here refers to partially unknown system dynamics. Unlike the infinite-horizon, the difficulty of finite-horizon optimal control problem is that the Hamilton–Jacobi–Bellman (HJB) equation is time-varying and must meet certain terminal boundary constraints, which brings greater challenges. At the same time, the partially unknown system dynamics have also caused additional difficulties. The main innovation of this paper is the proposed cyclic fixed-finite-horizon-based reinforcement learning algorithm to approximately solve the time-varying HJB equation. The proposed algorithm mainly consists of two phases: the data collection phase over a fixed-finite-horizon and the parameters update phase. A least-squares method is used to correlate the two phases to obtain the optimal parameters by cyclic. Finally, simulation results are given to verify the effectiveness of the proposed cyclic fixed-finite-horizon-based reinforcement learning algorithm.

源语言英语
页(从-至)2429-2440
页数12
期刊International Journal of Systems Science
51
13
DOI
出版状态已出版 - 2 10月 2020

指纹

探究 'Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhao, J., & Gan, M. (2020). Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning. International Journal of Systems Science, 51(13), 2429-2440. https://doi.org/10.1080/00207721.2020.1797223