TY - JOUR
T1 - Real-Time Adaptive Energy Management for Hybrid Electric Vehicles Based on Monte Carlo Tree Search
AU - Yang, Ningkang
AU - Han, Lijin
AU - Ruan, Shumin
AU - Zhang, Wannian
AU - Liu, Hui
AU - Xiang, Changle
N1 - Publisher Copyright:
IEEE
PY - 2023
Y1 - 2023
N2 - The performance of energy management strategies (EMSs) for hybrid electric vehicles (HEVs) is greatly impacted by the vehicles’ driving conditions. To handle the energy management problem in various unknown driving conditions, a real-time adaptive EMS is developed through a novel model-based reinforcement learning (RL) algorithm—Monte Carlo tree search (MCTS). First, a RL model is constructed to represent the HEV, which consists of a deterministic powertrain approximation model and a stochastic recursive Markov Chain. During online implementation, the model is continuously updated according to new conditions, guaranteeing its accuracy. Then, the MCTS algorithm is detailed. Different from traditional RL algorithms which learn a complete policy for a driving cycle, MCTS seeks to search the optimal action in real-time for each encountered HEV state. Combining the dynamic RL model and the real-time MCTS algorithm, the EMS can maintain satisfactory performance in various driving conditions with no prior information. In the simulation, the proposed strategy consumes 8.03%, 5.20%, and 4.34% less fuel compared with model-free Q-learning, model-based Q-learning, and model predictive control in a totally unknown cycle, and maintains the similar performance in other four cycles, which demonstrates its superior adaptability. Furthermore, experiments in a test bench also validate its effectiveness.
AB - The performance of energy management strategies (EMSs) for hybrid electric vehicles (HEVs) is greatly impacted by the vehicles’ driving conditions. To handle the energy management problem in various unknown driving conditions, a real-time adaptive EMS is developed through a novel model-based reinforcement learning (RL) algorithm—Monte Carlo tree search (MCTS). First, a RL model is constructed to represent the HEV, which consists of a deterministic powertrain approximation model and a stochastic recursive Markov Chain. During online implementation, the model is continuously updated according to new conditions, guaranteeing its accuracy. Then, the MCTS algorithm is detailed. Different from traditional RL algorithms which learn a complete policy for a driving cycle, MCTS seeks to search the optimal action in real-time for each encountered HEV state. Combining the dynamic RL model and the real-time MCTS algorithm, the EMS can maintain satisfactory performance in various driving conditions with no prior information. In the simulation, the proposed strategy consumes 8.03%, 5.20%, and 4.34% less fuel compared with model-free Q-learning, model-based Q-learning, and model predictive control in a totally unknown cycle, and maintains the similar performance in other four cycles, which demonstrates its superior adaptability. Furthermore, experiments in a test bench also validate its effectiveness.
KW - Adaptation models
KW - Computational modeling
KW - Energy management
KW - Fuels
KW - Hybrid electric vehicles
KW - Monte Carlo tree search
KW - Q-learning
KW - Real-time systems
KW - hybrid electric vehicle
KW - real-time energy management
KW - reinforcement learning
UR - https://www.scopus.com/pages/publications/85168271719
U2 - 10.1109/TTE.2023.3305520
DO - 10.1109/TTE.2023.3305520
M3 - Article
AN - SCOPUS:85168271719
SN - 2332-7782
SP - 1
JO - IEEE Transactions on Transportation Electrification
JF - IEEE Transactions on Transportation Electrification
ER -