Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning

Jingang Zhao, Minggang Gan*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

This paper investigates finite-horizon optimal control problem of continuous-time uncertain nonlinear systems. The uncertainty here refers to partially unknown system dynamics. Unlike the infinite-horizon, the difficulty of finite-horizon optimal control problem is that the Hamilton–Jacobi–Bellman (HJB) equation is time-varying and must meet certain terminal boundary constraints, which brings greater challenges. At the same time, the partially unknown system dynamics have also caused additional difficulties. The main innovation of this paper is the proposed cyclic fixed-finite-horizon-based reinforcement learning algorithm to approximately solve the time-varying HJB equation. The proposed algorithm mainly consists of two phases: the data collection phase over a fixed-finite-horizon and the parameters update phase. A least-squares method is used to correlate the two phases to obtain the optimal parameters by cyclic. Finally, simulation results are given to verify the effectiveness of the proposed cyclic fixed-finite-horizon-based reinforcement learning algorithm.

Original languageEnglish
Pages (from-to)2429-2440
Number of pages12
JournalInternational Journal of Systems Science
Volume51
Issue number13
DOIs
Publication statusPublished - 2 Oct 2020

Keywords

  • Finite-horizon
  • continuous-time
  • optimal control
  • reinforcement learning
  • uncertain nonlinear systems

Fingerprint

Dive into the research topics of 'Finite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learning'. Together they form a unique fingerprint.

Cite this