Inverse Model Predictive Control: Learning Optimal Control Cost Functions for MPC

Fawang Zhang, Jingliang Duan*, Haoyuan Xu, Hao Chen, Hui Liu, Shida Nie, Shengbo Eben Li

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Inverse optimal control (IOC) seeks to infer a control cost function that captures the underlying goals and preferences of expert demonstrations. While significant progress has been made in finite-horizon IOC, which focuses on learning control cost functions based on rollout trajectories rather than actual trajectories, the application of IOC to receding horizon control, also known as model predictive control (MPC), has been overlooked. MPC is more prevalent in practical settings and poses additional challenges for IOC learning since it is complicated to calculate the gradient of actual trajectories with respect to cost parameters. In light of this, we propose the inverse MPC (IMPC) method to identify the optimal cost function that effectively minimizes the discrepancy between the actual trajectory and its associated demonstration. To compute the gradient of actual trajectories with respect to cost parameters, we first establish two differential Pontryagin's maximum principle (PMP) conditions by differentiating the traditional PMP conditions with respect to cost parameters and initial states, respectively. We then formulate two auxiliary optimal control problems based on the derived differentiated PMP conditions, whose solutions can be directly used to determine the gradient for updating cost parameters. We validate the efficacy of the proposed method through experiments involving five simulation tasks and two real-world mobile robot control tasks. The results consistently demonstrate that IMPC outperforms existing finite-horizon IOC methods across all experiments.

Original languageEnglish
Pages (from-to)13644-13655
Number of pages12
JournalIEEE Transactions on Industrial Informatics
Volume20
Issue number12
DOIs
Publication statusPublished - 2024

Keywords

  • Bilevel optimization
  • imitation learning
  • inverse model predictive control (IMPC)

Fingerprint

Dive into the research topics of 'Inverse Model Predictive Control: Learning Optimal Control Cost Functions for MPC'. Together they form a unique fingerprint.

Cite this