Abstract
This paper investigates finite-horizon optimal control problem of continuous-time uncertain nonlinear systems. The uncertainty here refers to partially unknown system dynamics. Unlike the infinite-horizon, the difficulty of finite-horizon optimal control problem is that the Hamilton–Jacobi–Bellman (HJB) equation is time-varying and must meet certain terminal boundary constraints, which brings greater challenges. At the same time, the partially unknown system dynamics have also caused additional difficulties. The main innovation of this paper is the proposed cyclic fixed-finite-horizon-based reinforcement learning algorithm to approximately solve the time-varying HJB equation. The proposed algorithm mainly consists of two phases: the data collection phase over a fixed-finite-horizon and the parameters update phase. A least-squares method is used to correlate the two phases to obtain the optimal parameters by cyclic. Finally, simulation results are given to verify the effectiveness of the proposed cyclic fixed-finite-horizon-based reinforcement learning algorithm.
Original language | English |
---|---|
Pages (from-to) | 2429-2440 |
Number of pages | 12 |
Journal | International Journal of Systems Science |
Volume | 51 |
Issue number | 13 |
DOIs | |
Publication status | Published - 2 Oct 2020 |
Keywords
- Finite-horizon
- continuous-time
- optimal control
- reinforcement learning
- uncertain nonlinear systems