TY - GEN
T1 - Reinforcement Learning Energy Management for Hybrid Electric Tracked Vehicle with Deep Deterministic Policy Gradient
AU - Zhang, Bin
AU - Wu, Jinlong
AU - Zou, Yuan
AU - Zhang, Xudong
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2022
Y1 - 2022
N2 - Reinforcement learning (RL) has been applied to energy management of hybrid electric vehicles to synthesize the system efficiency and adaptability. However, the existing RL-based energy management strategies still suffer the “curse of dimensionality” due to the discretization of the state and control action variables. To cure this disadvantage, a continuous RL-based energy management adopting deep deterministic policy gradient (DDPG) is proposed and applied to a series hybrid electric tracked vehicle. First, DDPG-based energy management strategy is put forward, where two sets of neural networks are adopted to parameterize strategy and approximate the action-value function respectively to eliminate the discretization. In addition, an online updating framework of energy management is carried out to increase the adaptability of the energy management strategy. The simulation results show that the fuel consumption of the online updating strategy is 5.9% lower than that of the stationary strategy, and is close to that of dynamic programming benchmark strategy. Besides, the computational burden is significantly reduced and can be implemented in real-time.
AB - Reinforcement learning (RL) has been applied to energy management of hybrid electric vehicles to synthesize the system efficiency and adaptability. However, the existing RL-based energy management strategies still suffer the “curse of dimensionality” due to the discretization of the state and control action variables. To cure this disadvantage, a continuous RL-based energy management adopting deep deterministic policy gradient (DDPG) is proposed and applied to a series hybrid electric tracked vehicle. First, DDPG-based energy management strategy is put forward, where two sets of neural networks are adopted to parameterize strategy and approximate the action-value function respectively to eliminate the discretization. In addition, an online updating framework of energy management is carried out to increase the adaptability of the energy management strategy. The simulation results show that the fuel consumption of the online updating strategy is 5.9% lower than that of the stationary strategy, and is close to that of dynamic programming benchmark strategy. Besides, the computational burden is significantly reduced and can be implemented in real-time.
KW - Deep reinforcement learning
KW - Energy management
KW - Hybrid electric tracked vehicle
UR - http://www.scopus.com/inward/record.url?scp=85124001816&partnerID=8YFLogxK
U2 - 10.1007/978-981-16-2090-4_53
DO - 10.1007/978-981-16-2090-4_53
M3 - Conference contribution
AN - SCOPUS:85124001816
SN - 9789811620898
T3 - Lecture Notes in Electrical Engineering
SP - 879
EP - 893
BT - Proceedings of China SAE Congress 2020
PB - Springer Science and Business Media Deutschland GmbH
T2 - China SAE Congress, 2020
Y2 - 27 October 2020 through 29 October 2020
ER -