Neural network learning control of multi-input system with unknown dynamics

Yongfeng Lv, Xuemei Ren, Siqi Li, Huichao Li, Hengxing Lv

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

There are few studies on the optimal control of the multi-input system with different input dynamics in the literature. For this problem, the learning Nash controllers are obtained with a simplified-reinforcement learning (SRL) scheme and Nonzero-sum game theory. A neural network (NN) identifier is first established to approximate the unknown multi-input system. Then SRL NNs are used to approximate the optimal performance index of each input, which is used to learn the optimal control policies for the multi-input system. The weights of the NN architecture are tuned with a novel algorithm, and the parameter convergences are analyzed to be uniformly ultimately bounded. Finally, one two-input nonlinear system is introduced to verify the proposed learning control scheme.

Original languageEnglish
Title of host publicationProceedings of 2019 6th IEEE International Conference on Cloud Computing and Intelligence Systems, CCIS 2019
EditorsXizhao Wang, Weining Wang, Xiangnan He
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages169-173
Number of pages5
ISBN (Electronic)9781728138633
DOIs
Publication statusPublished - Dec 2019
Event6th IEEE International Conference on Cloud Computing and Intelligence Systems, CCIS 2019 - Singapore, Singapore
Duration: 19 Dec 201921 Dec 2019

Publication series

NameProceedings of 2019 6th IEEE International Conference on Cloud Computing and Intelligence Systems, CCIS 2019

Conference

Conference6th IEEE International Conference on Cloud Computing and Intelligence Systems, CCIS 2019
Country/TerritorySingapore
CitySingapore
Period19/12/1921/12/19

Keywords

  • Multi-input System
  • Neural Networks
  • Optimal Control
  • Reinforcement Learning

Fingerprint

Dive into the research topics of 'Neural network learning control of multi-input system with unknown dynamics'. Together they form a unique fingerprint.

Cite this