TY - JOUR
T1 - Implementation of real-time energy management strategy based on reinforcement learning for hybrid electric vehicles and simulation validation
AU - Kong, Zehui
AU - Zou, Yuan
AU - Liu, Teng
N1 - Publisher Copyright:
© 2017 Kong et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2017/7
Y1 - 2017/7
N2 - To further improve the fuel economy of series hybrid electric tracked vehicles, a reinforcement learning (RL)-based real-time energy management strategy is developed in this paper. In order to utilize the statistical characteristics of online driving schedule effectively, a recursive algorithm for the transition probability matrix (TPM) of power-request is derived. The reinforcement learning (RL) is applied to calculate and update the control policy at regular time, adapting to the varying driving conditions. A facing-forward powertrain model is built in detail, including the engine-generator model, battery model and vehicle dynamical model. The robustness and adaptability of real-time energy management strategy are validated through the comparison with the stationary control strategy based on initial transition probability matrix (TPM) generated from a long naturalistic driving cycle in the simulation. Results indicate that proposed method has better fuel economy than stationary one and is more effective in real-time control.
AB - To further improve the fuel economy of series hybrid electric tracked vehicles, a reinforcement learning (RL)-based real-time energy management strategy is developed in this paper. In order to utilize the statistical characteristics of online driving schedule effectively, a recursive algorithm for the transition probability matrix (TPM) of power-request is derived. The reinforcement learning (RL) is applied to calculate and update the control policy at regular time, adapting to the varying driving conditions. A facing-forward powertrain model is built in detail, including the engine-generator model, battery model and vehicle dynamical model. The robustness and adaptability of real-time energy management strategy are validated through the comparison with the stationary control strategy based on initial transition probability matrix (TPM) generated from a long naturalistic driving cycle in the simulation. Results indicate that proposed method has better fuel economy than stationary one and is more effective in real-time control.
UR - http://www.scopus.com/inward/record.url?scp=85021650158&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0180491
DO - 10.1371/journal.pone.0180491
M3 - Article
C2 - 28671967
AN - SCOPUS:85021650158
SN - 1932-6203
VL - 12
JO - PLoS ONE
JF - PLoS ONE
IS - 7
M1 - e0180491
ER -