Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics

Yongfeng Lv; Xuemei Ren; Jing Na

doi:10.1016/j.neucom.2017.12.045

Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics

Yongfeng Lv, Xuemei Ren^*, Jing Na

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

51 Citations (Scopus)

Abstract

In this paper, a data-driven approximate neural network (NN) learning scheme is developed to solve the multi-player nonzero-sum (NZS) game problem with completely unknown system dynamics. An augmented NN identifier based on a new parameter estimation algorithm is first established to approximate the completely unknown system dynamics. Then approximated dynamic programming (ADP) with neural networks is constructed to approximate the optimal solutions of the coupled Hamilton-Jacobi equations for each player. The approximated NN value functions are then used to synchronously calculate the optimal control policies for every player. The identifier and ADP NN weights are online updated with the system input-output data based on a novel adaptive law, which could achieve a faster convergence speed. Moreover, the convergence of all NN weights and the stability of the closed-loop system are proved based on the Lyapunov approach. Finally, a dual-driven servo motor system and a three-player nonlinear game system are simulated to verify the feasibility of the developed methods.

Original language	English
Pages (from-to)	87-97
Number of pages	11
Journal	Neurocomputing
Volume	283
DOIs	https://doi.org/10.1016/j.neucom.2017.12.045
Publication status	Published - 29 Mar 2018

Keywords

Adaptive dynamic programming
Multi-player nonzero-sum games
Neural networks
Optimal control
System identification

Access to Document

10.1016/j.neucom.2017.12.045

Cite this

@article{1a9c76cfdd72400cbd515442053e1008,

title = "Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics",

abstract = "In this paper, a data-driven approximate neural network (NN) learning scheme is developed to solve the multi-player nonzero-sum (NZS) game problem with completely unknown system dynamics. An augmented NN identifier based on a new parameter estimation algorithm is first established to approximate the completely unknown system dynamics. Then approximated dynamic programming (ADP) with neural networks is constructed to approximate the optimal solutions of the coupled Hamilton-Jacobi equations for each player. The approximated NN value functions are then used to synchronously calculate the optimal control policies for every player. The identifier and ADP NN weights are online updated with the system input-output data based on a novel adaptive law, which could achieve a faster convergence speed. Moreover, the convergence of all NN weights and the stability of the closed-loop system are proved based on the Lyapunov approach. Finally, a dual-driven servo motor system and a three-player nonlinear game system are simulated to verify the feasibility of the developed methods.",

keywords = "Adaptive dynamic programming, Multi-player nonzero-sum games, Neural networks, Optimal control, System identification",

author = "Yongfeng Lv and Xuemei Ren and Jing Na",

note = "Publisher Copyright: {\textcopyright} 2017",

year = "2018",

month = mar,

day = "29",

doi = "10.1016/j.neucom.2017.12.045",

language = "English",

volume = "283",

pages = "87--97",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics

AU - Lv, Yongfeng

AU - Ren, Xuemei

AU - Na, Jing

PY - 2018/3/29

Y1 - 2018/3/29

N2 - In this paper, a data-driven approximate neural network (NN) learning scheme is developed to solve the multi-player nonzero-sum (NZS) game problem with completely unknown system dynamics. An augmented NN identifier based on a new parameter estimation algorithm is first established to approximate the completely unknown system dynamics. Then approximated dynamic programming (ADP) with neural networks is constructed to approximate the optimal solutions of the coupled Hamilton-Jacobi equations for each player. The approximated NN value functions are then used to synchronously calculate the optimal control policies for every player. The identifier and ADP NN weights are online updated with the system input-output data based on a novel adaptive law, which could achieve a faster convergence speed. Moreover, the convergence of all NN weights and the stability of the closed-loop system are proved based on the Lyapunov approach. Finally, a dual-driven servo motor system and a three-player nonlinear game system are simulated to verify the feasibility of the developed methods.

AB - In this paper, a data-driven approximate neural network (NN) learning scheme is developed to solve the multi-player nonzero-sum (NZS) game problem with completely unknown system dynamics. An augmented NN identifier based on a new parameter estimation algorithm is first established to approximate the completely unknown system dynamics. Then approximated dynamic programming (ADP) with neural networks is constructed to approximate the optimal solutions of the coupled Hamilton-Jacobi equations for each player. The approximated NN value functions are then used to synchronously calculate the optimal control policies for every player. The identifier and ADP NN weights are online updated with the system input-output data based on a novel adaptive law, which could achieve a faster convergence speed. Moreover, the convergence of all NN weights and the stability of the closed-loop system are proved based on the Lyapunov approach. Finally, a dual-driven servo motor system and a three-player nonlinear game system are simulated to verify the feasibility of the developed methods.

KW - Adaptive dynamic programming

KW - Multi-player nonzero-sum games

KW - Neural networks

KW - Optimal control

KW - System identification

UR - http://www.scopus.com/inward/record.url?scp=85039926771&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2017.12.045

DO - 10.1016/j.neucom.2017.12.045

M3 - Article

AN - SCOPUS:85039926771

SN - 0925-2312

VL - 283

SP - 87

EP - 97

JO - Neurocomputing

JF - Neurocomputing

ER -

Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this