Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics

Yongfeng Lv, Xuemei Ren*, Jing Na

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

49 Citations (Scopus)

Abstract

In this paper, a data-driven approximate neural network (NN) learning scheme is developed to solve the multi-player nonzero-sum (NZS) game problem with completely unknown system dynamics. An augmented NN identifier based on a new parameter estimation algorithm is first established to approximate the completely unknown system dynamics. Then approximated dynamic programming (ADP) with neural networks is constructed to approximate the optimal solutions of the coupled Hamilton-Jacobi equations for each player. The approximated NN value functions are then used to synchronously calculate the optimal control policies for every player. The identifier and ADP NN weights are online updated with the system input-output data based on a novel adaptive law, which could achieve a faster convergence speed. Moreover, the convergence of all NN weights and the stability of the closed-loop system are proved based on the Lyapunov approach. Finally, a dual-driven servo motor system and a three-player nonlinear game system are simulated to verify the feasibility of the developed methods.

Original languageEnglish
Pages (from-to)87-97
Number of pages11
JournalNeurocomputing
Volume283
DOIs
Publication statusPublished - 29 Mar 2018

Keywords

  • Adaptive dynamic programming
  • Multi-player nonzero-sum games
  • Neural networks
  • Optimal control
  • System identification

Fingerprint

Dive into the research topics of 'Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics'. Together they form a unique fingerprint.

Cite this