TY - JOUR
T1 - Personalized Lane Change Planning and Control by Imitation Learning from Drivers
AU - Tian, Hanqing
AU - Wei, Chao
AU - Jiang, Chaoyang
AU - Li, Zirui
AU - Hu, Jibin
N1 - Publisher Copyright:
© 1982-2012 IEEE.
PY - 2023/4/1
Y1 - 2023/4/1
N2 - In this article, we propose a novel personalized planning and control approach for lane change assistance system which can efficiently learn a model prediction control (MPC)-based driver-specific lane-changing policy via end-to-end imitation learning from a few driver demonstrations. Specifically, we build a novel learnable predictive model of the vehicle-driver system and design an adaptable cost function for the MPC-based lane change controller. We then calculate the gradient of the imitation loss with respect to the personalization parameters of the model and cost function via differentiating the optimality conditions, and update those parameters to minimize the imitation loss in an end-to-end fashion. A semi-physical simulation on a driving simulator and a closed-loop test on a real vehicle are conducted to validate the learning ability and personalized control performance. The results show that 1) the proposed method can automatically implement both the generalized and the personalized lane change planning and control by learning from demonstration data; 2) the proposed controller can adapt to different driver-specific behaviors; and 3) the proposed approach outperforms the model-free learning approach in terms of imitation accuracy, interpretability, data efficiency, and generalized performance.
AB - In this article, we propose a novel personalized planning and control approach for lane change assistance system which can efficiently learn a model prediction control (MPC)-based driver-specific lane-changing policy via end-to-end imitation learning from a few driver demonstrations. Specifically, we build a novel learnable predictive model of the vehicle-driver system and design an adaptable cost function for the MPC-based lane change controller. We then calculate the gradient of the imitation loss with respect to the personalization parameters of the model and cost function via differentiating the optimality conditions, and update those parameters to minimize the imitation loss in an end-to-end fashion. A semi-physical simulation on a driving simulator and a closed-loop test on a real vehicle are conducted to validate the learning ability and personalized control performance. The results show that 1) the proposed method can automatically implement both the generalized and the personalized lane change planning and control by learning from demonstration data; 2) the proposed controller can adapt to different driver-specific behaviors; and 3) the proposed approach outperforms the model-free learning approach in terms of imitation accuracy, interpretability, data efficiency, and generalized performance.
KW - Differentiable optimization
KW - imitation learning
KW - lane change
KW - model predictive control (MPC)
KW - personalized driver assistance system
UR - http://www.scopus.com/inward/record.url?scp=85131763831&partnerID=8YFLogxK
U2 - 10.1109/TIE.2022.3177788
DO - 10.1109/TIE.2022.3177788
M3 - Article
AN - SCOPUS:85131763831
SN - 0278-0046
VL - 70
SP - 3995
EP - 4006
JO - IEEE Transactions on Industrial Electronics
JF - IEEE Transactions on Industrial Electronics
IS - 4
ER -