Personalized Lane Change Planning and Control by Imitation Learning from Drivers

Hanqing Tian; Chao Wei; Chaoyang Jiang; Zirui Li; Jibin Hu

doi:10.1109/TIE.2022.3177788

Personalized Lane Change Planning and Control by Imitation Learning from Drivers

Hanqing Tian, Chao Wei^*, Chaoyang Jiang, Zirui Li, Jibin Hu

^*此作品的通讯作者

机械与车辆学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

12 引用（Scopus）

摘要

In this article, we propose a novel personalized planning and control approach for lane change assistance system which can efficiently learn a model prediction control (MPC)-based driver-specific lane-changing policy via end-to-end imitation learning from a few driver demonstrations. Specifically, we build a novel learnable predictive model of the vehicle-driver system and design an adaptable cost function for the MPC-based lane change controller. We then calculate the gradient of the imitation loss with respect to the personalization parameters of the model and cost function via differentiating the optimality conditions, and update those parameters to minimize the imitation loss in an end-to-end fashion. A semi-physical simulation on a driving simulator and a closed-loop test on a real vehicle are conducted to validate the learning ability and personalized control performance. The results show that 1) the proposed method can automatically implement both the generalized and the personalized lane change planning and control by learning from demonstration data; 2) the proposed controller can adapt to different driver-specific behaviors; and 3) the proposed approach outperforms the model-free learning approach in terms of imitation accuracy, interpretability, data efficiency, and generalized performance.

源语言	英语
页（从-至）	3995-4006
页数	12
期刊	IEEE Transactions on Industrial Electronics
卷	70
期	4
DOI	https://doi.org/10.1109/TIE.2022.3177788
出版状态	已出版 - 1 4月 2023

访问文件

10.1109/TIE.2022.3177788

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{063ff4bc998f45dbb1be4d656defbce0,

title = "Personalized Lane Change Planning and Control by Imitation Learning from Drivers",

abstract = "In this article, we propose a novel personalized planning and control approach for lane change assistance system which can efficiently learn a model prediction control (MPC)-based driver-specific lane-changing policy via end-to-end imitation learning from a few driver demonstrations. Specifically, we build a novel learnable predictive model of the vehicle-driver system and design an adaptable cost function for the MPC-based lane change controller. We then calculate the gradient of the imitation loss with respect to the personalization parameters of the model and cost function via differentiating the optimality conditions, and update those parameters to minimize the imitation loss in an end-to-end fashion. A semi-physical simulation on a driving simulator and a closed-loop test on a real vehicle are conducted to validate the learning ability and personalized control performance. The results show that 1) the proposed method can automatically implement both the generalized and the personalized lane change planning and control by learning from demonstration data; 2) the proposed controller can adapt to different driver-specific behaviors; and 3) the proposed approach outperforms the model-free learning approach in terms of imitation accuracy, interpretability, data efficiency, and generalized performance.",

keywords = "Differentiable optimization, imitation learning, lane change, model predictive control (MPC), personalized driver assistance system",

author = "Hanqing Tian and Chao Wei and Chaoyang Jiang and Zirui Li and Jibin Hu",

note = "Publisher Copyright: {\textcopyright} 1982-2012 IEEE.",

year = "2023",

month = apr,

day = "1",

doi = "10.1109/TIE.2022.3177788",

language = "English",

volume = "70",

pages = "3995--4006",

journal = "IEEE Transactions on Industrial Electronics",

issn = "0278-0046",

publisher = "IEEE Industrial Electronics Society",

number = "4",

}

TY - JOUR

T1 - Personalized Lane Change Planning and Control by Imitation Learning from Drivers

AU - Tian, Hanqing

AU - Wei, Chao

AU - Jiang, Chaoyang

AU - Li, Zirui

AU - Hu, Jibin

PY - 2023/4/1

Y1 - 2023/4/1

N2 - In this article, we propose a novel personalized planning and control approach for lane change assistance system which can efficiently learn a model prediction control (MPC)-based driver-specific lane-changing policy via end-to-end imitation learning from a few driver demonstrations. Specifically, we build a novel learnable predictive model of the vehicle-driver system and design an adaptable cost function for the MPC-based lane change controller. We then calculate the gradient of the imitation loss with respect to the personalization parameters of the model and cost function via differentiating the optimality conditions, and update those parameters to minimize the imitation loss in an end-to-end fashion. A semi-physical simulation on a driving simulator and a closed-loop test on a real vehicle are conducted to validate the learning ability and personalized control performance. The results show that 1) the proposed method can automatically implement both the generalized and the personalized lane change planning and control by learning from demonstration data; 2) the proposed controller can adapt to different driver-specific behaviors; and 3) the proposed approach outperforms the model-free learning approach in terms of imitation accuracy, interpretability, data efficiency, and generalized performance.

AB - In this article, we propose a novel personalized planning and control approach for lane change assistance system which can efficiently learn a model prediction control (MPC)-based driver-specific lane-changing policy via end-to-end imitation learning from a few driver demonstrations. Specifically, we build a novel learnable predictive model of the vehicle-driver system and design an adaptable cost function for the MPC-based lane change controller. We then calculate the gradient of the imitation loss with respect to the personalization parameters of the model and cost function via differentiating the optimality conditions, and update those parameters to minimize the imitation loss in an end-to-end fashion. A semi-physical simulation on a driving simulator and a closed-loop test on a real vehicle are conducted to validate the learning ability and personalized control performance. The results show that 1) the proposed method can automatically implement both the generalized and the personalized lane change planning and control by learning from demonstration data; 2) the proposed controller can adapt to different driver-specific behaviors; and 3) the proposed approach outperforms the model-free learning approach in terms of imitation accuracy, interpretability, data efficiency, and generalized performance.

KW - Differentiable optimization

KW - imitation learning

KW - lane change

KW - model predictive control (MPC)

KW - personalized driver assistance system

UR - http://www.scopus.com/inward/record.url?scp=85131763831&partnerID=8YFLogxK

U2 - 10.1109/TIE.2022.3177788

DO - 10.1109/TIE.2022.3177788

M3 - Article

AN - SCOPUS:85131763831

SN - 0278-0046

VL - 70

SP - 3995

EP - 4006

JO - IEEE Transactions on Industrial Electronics

JF - IEEE Transactions on Industrial Electronics

IS - 4

ER -

Personalized Lane Change Planning and Control by Imitation Learning from Drivers

摘要

访问文件

其它文件与链接

指纹

引用此