Inverse linear quadratic dynamic games using partial state observations

Chengpu Yu; Yao Li; Shukai Li; Jie Chen

doi:10.1016/j.automatica.2022.110534

Inverse linear quadratic dynamic games using partial state observations

Chengpu Yu^*, Yao Li, Shukai Li, Jie Chen

^*此作品的通讯作者

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

4 引用（Scopus）

摘要

As an extension of the inverse optimal control, the inverse linear quadratic (LQ) two-player dynamic game is studied in this paper. The considered inverse problem is to infer the cost function of one player using partial state observations as well as the control inputs of the other player. An identification framework is designed by firstly decoupling the causal and anticausal parts of the associated Hamilton–Jacobi–Bellman (HJB) equation and then identifying the coefficient matrices in the cost function. The twofold features of the presented method include: (i) the data-driven identification approach provides an easy-to-implement solution which avoids the direct optimization of a non-convex inverse problem as well as complicated algebraic manipulations on Riccati equations; (ii) the identification framework does not rely on the initial states or terminal costates, which enables its implementation using only segments of data trajectories. The effectiveness of the proposed method is demonstrated by simulation examples.

源语言	英语
文章编号	110534
期刊	Automatica
卷	145
DOI	https://doi.org/10.1016/j.automatica.2022.110534
出版状态	已出版 - 11月 2022

访问文件

10.1016/j.automatica.2022.110534

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{cc45cccdf59e4f8a9c4413b06a35d555,

title = "Inverse linear quadratic dynamic games using partial state observations",

abstract = "As an extension of the inverse optimal control, the inverse linear quadratic (LQ) two-player dynamic game is studied in this paper. The considered inverse problem is to infer the cost function of one player using partial state observations as well as the control inputs of the other player. An identification framework is designed by firstly decoupling the causal and anticausal parts of the associated Hamilton–Jacobi–Bellman (HJB) equation and then identifying the coefficient matrices in the cost function. The twofold features of the presented method include: (i) the data-driven identification approach provides an easy-to-implement solution which avoids the direct optimization of a non-convex inverse problem as well as complicated algebraic manipulations on Riccati equations; (ii) the identification framework does not rely on the initial states or terminal costates, which enables its implementation using only segments of data trajectories. The effectiveness of the proposed method is demonstrated by simulation examples.",

keywords = "Causal-and-anticausal models, Data-driven identification, Two-player LQ games",

author = "Chengpu Yu and Yao Li and Shukai Li and Jie Chen",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier Ltd",

year = "2022",

month = nov,

doi = "10.1016/j.automatica.2022.110534",

language = "English",

volume = "145",

journal = "Automatica",

issn = "0005-1098",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Inverse linear quadratic dynamic games using partial state observations

AU - Yu, Chengpu

AU - Li, Yao

AU - Li, Shukai

AU - Chen, Jie

PY - 2022/11

Y1 - 2022/11

N2 - As an extension of the inverse optimal control, the inverse linear quadratic (LQ) two-player dynamic game is studied in this paper. The considered inverse problem is to infer the cost function of one player using partial state observations as well as the control inputs of the other player. An identification framework is designed by firstly decoupling the causal and anticausal parts of the associated Hamilton–Jacobi–Bellman (HJB) equation and then identifying the coefficient matrices in the cost function. The twofold features of the presented method include: (i) the data-driven identification approach provides an easy-to-implement solution which avoids the direct optimization of a non-convex inverse problem as well as complicated algebraic manipulations on Riccati equations; (ii) the identification framework does not rely on the initial states or terminal costates, which enables its implementation using only segments of data trajectories. The effectiveness of the proposed method is demonstrated by simulation examples.

AB - As an extension of the inverse optimal control, the inverse linear quadratic (LQ) two-player dynamic game is studied in this paper. The considered inverse problem is to infer the cost function of one player using partial state observations as well as the control inputs of the other player. An identification framework is designed by firstly decoupling the causal and anticausal parts of the associated Hamilton–Jacobi–Bellman (HJB) equation and then identifying the coefficient matrices in the cost function. The twofold features of the presented method include: (i) the data-driven identification approach provides an easy-to-implement solution which avoids the direct optimization of a non-convex inverse problem as well as complicated algebraic manipulations on Riccati equations; (ii) the identification framework does not rely on the initial states or terminal costates, which enables its implementation using only segments of data trajectories. The effectiveness of the proposed method is demonstrated by simulation examples.

KW - Causal-and-anticausal models

KW - Data-driven identification

KW - Two-player LQ games

UR - http://www.scopus.com/inward/record.url?scp=85136730550&partnerID=8YFLogxK

U2 - 10.1016/j.automatica.2022.110534

DO - 10.1016/j.automatica.2022.110534

M3 - Article

AN - SCOPUS:85136730550

SN - 0005-1098

VL - 145

JO - Automatica

JF - Automatica

M1 - 110534

ER -

Inverse linear quadratic dynamic games using partial state observations

摘要

访问文件

其它文件与链接

指纹

引用此