TY - GEN
T1 - Reinforcement Learning Improved Control Architecture for Following Vehicle in Platooning Driving
AU - Lu, Xiaoran
AU - Zou, Yuan
AU - Zhang, Xudong
AU - Liu, Haitao
AU - Chen, Yijie
AU - Zhang, Bin
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
PY - 2026
Y1 - 2026
N2 - To improve the mobility and range of Platooning Driving for material transportation or regional inspection tasks in off-road environments, this paper proposed an reinforcement learning improved architecture for the speed and energy control for the following vehicle, the primary task of the architecture is to fulfill the stable and high maneuverability requirement of the following vehicle, the secondary task is to enhance the fuel economic efficiency of the engine- generator sat.For the speed control system, the Deep Deterministic Policy Gradient (DDPG) algorithm is used to control the speed which is critical for the following task. For the energy management system(EMS), a n-step improved Twin Delayed Deep Deterministic Policy Gradient (TD3-Nstep) algorithm is built to control the engine-generator set for the power supply purpose, instead of power following control method, the EMS aims at stabilizing the bus voltage of the vehicle, which can support the maneuvering characteristics of the platooning driving. Simulation experiments show that the proposed algorithm enables the platooning vehicle to have good performance in high-mobility characteristics and energy system stability.
AB - To improve the mobility and range of Platooning Driving for material transportation or regional inspection tasks in off-road environments, this paper proposed an reinforcement learning improved architecture for the speed and energy control for the following vehicle, the primary task of the architecture is to fulfill the stable and high maneuverability requirement of the following vehicle, the secondary task is to enhance the fuel economic efficiency of the engine- generator sat.For the speed control system, the Deep Deterministic Policy Gradient (DDPG) algorithm is used to control the speed which is critical for the following task. For the energy management system(EMS), a n-step improved Twin Delayed Deep Deterministic Policy Gradient (TD3-Nstep) algorithm is built to control the engine-generator set for the power supply purpose, instead of power following control method, the EMS aims at stabilizing the bus voltage of the vehicle, which can support the maneuvering characteristics of the platooning driving. Simulation experiments show that the proposed algorithm enables the platooning vehicle to have good performance in high-mobility characteristics and energy system stability.
KW - energy management system
KW - following vehicle
KW - platooning driving
KW - reinforcement learning
KW - speed and energy control
UR - https://www.scopus.com/pages/publications/105028087549
U2 - 10.1007/978-981-95-4875-0_4
DO - 10.1007/978-981-95-4875-0_4
M3 - Conference contribution
AN - SCOPUS:105028087549
SN - 9789819548743
T3 - Communications in Computer and Information Science
SP - 38
EP - 50
BT - Intelligent Vehicles - 3rd CCF Intelligent Vehicles Symposium, CIVS 2025, Revised Selected Papers
A2 - Li, Huiyun
A2 - Wang, Zhongli
A2 - Zhao, Shuai
A2 - Sun, Peng
A2 - Herrmann, Michael
A2 - Zheng, Xi
A2 - Liu, Yuling
PB - Springer Science and Business Media Deutschland GmbH
T2 - 3rd CCF Intelligent Vehicles Symposium, CIVS 2025
Y2 - 16 August 2025 through 18 August 2025
ER -