TY - JOUR
T1 - A Q-learning fuzzy inference system based online energy management strategy for off-road hybrid electric vehicles
AU - Bo, Lin
AU - Han, Lijin
AU - Xiang, Changle
AU - Liu, Hui
AU - Ma, Tian
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2022/8/1
Y1 - 2022/8/1
N2 - In this paper, a Q-learning fuzzy inference system (QLFIS)-based online control architecture is proposed and applied for the optimal control of off-road hybrid electric vehicles (HEVs) to achieve better dynamic performance, fuel economy and real-time performance. A dynamic model, including a hybrid system, vehicle dynamics and road model, is established to obtain the state feedback according to the current driving environment under command. The optimal control strategy and objective function are both constructed by an adaptive network fuzzy inference system (ANFIS) due to its strong approaching ability. The fuzzy rules and parameters are trained online through the Q-learning algorithm and gradient descent method. This control framework provides a new control idea for the control of off-road vehicles. Without knowing the driving cycle in advance, it achieves a good control effect for different driving environments through online data collection and training. The QLFIS-based control strategy is compared to dynamic programming (DP)-based and rule-based strategies based on two different off-road driving cycles through simulation. The simulation results show that the vehicle dynamic performance and fuel economy are improved with respect to the rule-based strategy, while the calculation time is greatly reduced compared to that of the DP-based strategy.
AB - In this paper, a Q-learning fuzzy inference system (QLFIS)-based online control architecture is proposed and applied for the optimal control of off-road hybrid electric vehicles (HEVs) to achieve better dynamic performance, fuel economy and real-time performance. A dynamic model, including a hybrid system, vehicle dynamics and road model, is established to obtain the state feedback according to the current driving environment under command. The optimal control strategy and objective function are both constructed by an adaptive network fuzzy inference system (ANFIS) due to its strong approaching ability. The fuzzy rules and parameters are trained online through the Q-learning algorithm and gradient descent method. This control framework provides a new control idea for the control of off-road vehicles. Without knowing the driving cycle in advance, it achieves a good control effect for different driving environments through online data collection and training. The QLFIS-based control strategy is compared to dynamic programming (DP)-based and rule-based strategies based on two different off-road driving cycles through simulation. The simulation results show that the vehicle dynamic performance and fuel economy are improved with respect to the rule-based strategy, while the calculation time is greatly reduced compared to that of the DP-based strategy.
KW - Energy management strategy
KW - HEV
KW - Off-road vehicle
KW - Q-learning fuzzy inference system (QLFIS)
UR - http://www.scopus.com/inward/record.url?scp=85129024563&partnerID=8YFLogxK
U2 - 10.1016/j.energy.2022.123976
DO - 10.1016/j.energy.2022.123976
M3 - Article
AN - SCOPUS:85129024563
SN - 0360-5442
VL - 252
JO - Energy
JF - Energy
M1 - 123976
ER -