TY - JOUR
T1 - Online Updating Energy Management Strategy Based on Deep Reinforcement Learning with Accelerated Training for Hybrid Electric Tracked Vehicles
AU - Zhang, Bin
AU - Zou, Yuan
AU - Zhang, Xudong
AU - Du, Guodong
AU - Jiao, Feixiang
AU - Guo, Ningyuan
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2022/9/1
Y1 - 2022/9/1
N2 - An online updating energy management strategy (EMS) based on deep reinforcement learning (DRL) with accelerated training is proposed to further reduce fuel consumption and improve the adaptability of the algorithm. The online frame continuously updates neural network parameters every predetermined time. First, the mathematical model of a series hybrid electric tracked vehicle (SHETV) is established, and the accuracy of the model is verified by collecting data from the real vehicle. Second, an intelligent EMS is developed by combining deep deterministic policy gradient (DDPG) and prioritized experience replay (PER). DDPG can improve the control effect and accelerate the training process by eliminating the discretization of variables. The addition of PER can further improve fuel economy and SOC performance and shorten the training time. Third, an online updating framework based on DDPG-PER is proposed to make EMS better adapt to complex driving conditions. Finally, a software-in-the-loop simulation is built, and the effectiveness of the proposed online updating EMS is verified by comparison with other state-of-the-art algorithms. Simulation results show that the training process of the off-line EMS is greatly accelerated, which provides a potential for online application. Meanwhile, the fuel economy of the online updating EMS reached 93.9% of the benchmark DP.
AB - An online updating energy management strategy (EMS) based on deep reinforcement learning (DRL) with accelerated training is proposed to further reduce fuel consumption and improve the adaptability of the algorithm. The online frame continuously updates neural network parameters every predetermined time. First, the mathematical model of a series hybrid electric tracked vehicle (SHETV) is established, and the accuracy of the model is verified by collecting data from the real vehicle. Second, an intelligent EMS is developed by combining deep deterministic policy gradient (DDPG) and prioritized experience replay (PER). DDPG can improve the control effect and accelerate the training process by eliminating the discretization of variables. The addition of PER can further improve fuel economy and SOC performance and shorten the training time. Third, an online updating framework based on DDPG-PER is proposed to make EMS better adapt to complex driving conditions. Finally, a software-in-the-loop simulation is built, and the effectiveness of the proposed online updating EMS is verified by comparison with other state-of-the-art algorithms. Simulation results show that the training process of the off-line EMS is greatly accelerated, which provides a potential for online application. Meanwhile, the fuel economy of the online updating EMS reached 93.9% of the benchmark DP.
KW - Accelerated training
KW - deep deterministic policy gradient (DDPG)
KW - energy management strategy (EMS)
KW - online updating framework
KW - software-in-the-loop simulation
UR - https://www.scopus.com/pages/publications/85125729201
U2 - 10.1109/TTE.2022.3156590
DO - 10.1109/TTE.2022.3156590
M3 - Article
AN - SCOPUS:85125729201
SN - 2332-7782
VL - 8
SP - 3289
EP - 3306
JO - IEEE Transactions on Transportation Electrification
JF - IEEE Transactions on Transportation Electrification
IS - 3
ER -