Inverse linear quadratic dynamic games using partial state observations

Chengpu Yu; Yao Li; Shukai Li; Jie Chen

doi:10.1016/j.automatica.2022.110534

Inverse linear quadratic dynamic games using partial state observations

Chengpu Yu^*, Yao Li, Shukai Li, Jie Chen

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

Abstract

As an extension of the inverse optimal control, the inverse linear quadratic (LQ) two-player dynamic game is studied in this paper. The considered inverse problem is to infer the cost function of one player using partial state observations as well as the control inputs of the other player. An identification framework is designed by firstly decoupling the causal and anticausal parts of the associated Hamilton–Jacobi–Bellman (HJB) equation and then identifying the coefficient matrices in the cost function. The twofold features of the presented method include: (i) the data-driven identification approach provides an easy-to-implement solution which avoids the direct optimization of a non-convex inverse problem as well as complicated algebraic manipulations on Riccati equations; (ii) the identification framework does not rely on the initial states or terminal costates, which enables its implementation using only segments of data trajectories. The effectiveness of the proposed method is demonstrated by simulation examples.

Original language	English
Article number	110534
Journal	Automatica
Volume	145
DOIs	https://doi.org/10.1016/j.automatica.2022.110534
Publication status	Published - Nov 2022

Keywords

Causal-and-anticausal models
Data-driven identification
Two-player LQ games

Access to Document

10.1016/j.automatica.2022.110534

Cite this

@article{cc45cccdf59e4f8a9c4413b06a35d555,

title = "Inverse linear quadratic dynamic games using partial state observations",

abstract = "As an extension of the inverse optimal control, the inverse linear quadratic (LQ) two-player dynamic game is studied in this paper. The considered inverse problem is to infer the cost function of one player using partial state observations as well as the control inputs of the other player. An identification framework is designed by firstly decoupling the causal and anticausal parts of the associated Hamilton–Jacobi–Bellman (HJB) equation and then identifying the coefficient matrices in the cost function. The twofold features of the presented method include: (i) the data-driven identification approach provides an easy-to-implement solution which avoids the direct optimization of a non-convex inverse problem as well as complicated algebraic manipulations on Riccati equations; (ii) the identification framework does not rely on the initial states or terminal costates, which enables its implementation using only segments of data trajectories. The effectiveness of the proposed method is demonstrated by simulation examples.",

keywords = "Causal-and-anticausal models, Data-driven identification, Two-player LQ games",

author = "Chengpu Yu and Yao Li and Shukai Li and Jie Chen",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier Ltd",

year = "2022",

month = nov,

doi = "10.1016/j.automatica.2022.110534",

language = "English",

volume = "145",

journal = "Automatica",

issn = "0005-1098",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Inverse linear quadratic dynamic games using partial state observations

AU - Yu, Chengpu

AU - Li, Yao

AU - Li, Shukai

AU - Chen, Jie

PY - 2022/11

Y1 - 2022/11

N2 - As an extension of the inverse optimal control, the inverse linear quadratic (LQ) two-player dynamic game is studied in this paper. The considered inverse problem is to infer the cost function of one player using partial state observations as well as the control inputs of the other player. An identification framework is designed by firstly decoupling the causal and anticausal parts of the associated Hamilton–Jacobi–Bellman (HJB) equation and then identifying the coefficient matrices in the cost function. The twofold features of the presented method include: (i) the data-driven identification approach provides an easy-to-implement solution which avoids the direct optimization of a non-convex inverse problem as well as complicated algebraic manipulations on Riccati equations; (ii) the identification framework does not rely on the initial states or terminal costates, which enables its implementation using only segments of data trajectories. The effectiveness of the proposed method is demonstrated by simulation examples.

AB - As an extension of the inverse optimal control, the inverse linear quadratic (LQ) two-player dynamic game is studied in this paper. The considered inverse problem is to infer the cost function of one player using partial state observations as well as the control inputs of the other player. An identification framework is designed by firstly decoupling the causal and anticausal parts of the associated Hamilton–Jacobi–Bellman (HJB) equation and then identifying the coefficient matrices in the cost function. The twofold features of the presented method include: (i) the data-driven identification approach provides an easy-to-implement solution which avoids the direct optimization of a non-convex inverse problem as well as complicated algebraic manipulations on Riccati equations; (ii) the identification framework does not rely on the initial states or terminal costates, which enables its implementation using only segments of data trajectories. The effectiveness of the proposed method is demonstrated by simulation examples.

KW - Causal-and-anticausal models

KW - Data-driven identification

KW - Two-player LQ games

UR - http://www.scopus.com/inward/record.url?scp=85136730550&partnerID=8YFLogxK

U2 - 10.1016/j.automatica.2022.110534

DO - 10.1016/j.automatica.2022.110534

M3 - Article

AN - SCOPUS:85136730550

SN - 0005-1098

VL - 145

JO - Automatica

JF - Automatica

M1 - 110534

ER -

Inverse linear quadratic dynamic games using partial state observations

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this