Battery life constrained real-time energy management strategy for hybrid electric vehicles based on reinforcement learning

Lijin Han; Ke Yang; Tian Ma; Ningkang Yang; Hui Liu; Lingxiong Guo

doi:10.1016/j.energy.2022.124986

Battery life constrained real-time energy management strategy for hybrid electric vehicles based on reinforcement learning

Lijin Han, Ke Yang, Tian Ma, Ningkang Yang^*, Hui Liu, Lingxiong Guo

^*Corresponding author for this work

School of Mechanical Engineering

Research output: Contribution to journal › Article › peer-review

24 Citations (Scopus)

Abstract

Hybrid electric vehicles (HEVs) bridge the gap between internal combustion engine vehicles and pure electric vehicles, and are therefore regarded as a promising solution to the energy crisis. This paper proposes a real-time energy management strategy (EMS) for hybrid electric vehicles based on reinforcement learning (RL) to improve fuel economy and minimize battery degradation. First, an online recursive Markov chain (MC) is developed that continuously collects statistical features from actual driving conditions, and thus an adaptive and accurate environment model is established. Then, a novel RL algorithm, eligibility trace, is introduced to learn the control policy online based on MC model. By introducing a trace-decay parameter, the eligibility trace algorithm unifies the returns of different steps, forming a more reliable estimate of the optimal value function, and therefore outperforms traditional RL algorithms in optimization. Furthermore, induced matrix norm (IMN) is employed as a standard to measure difference between transition probability matrices (TPM) of MC and to decide when to update environment model as well as recalculate the control policy. Therefore, the EMS's adaptability to various driving conditions are significantly enhanced. Simulation results indicate that eligibility trace shows the best performance in both improving fuel economy and reducing battery life loss compared with Q-learning and rule-based method.

Original language	English
Article number	124986
Journal	Energy
Volume	259
DOIs	https://doi.org/10.1016/j.energy.2022.124986
Publication status	Published - 15 Nov 2022

Keywords

Battery life
Eligibility trace
Hybrid electric vehicle
Real-time energy management strategy
Reinforcement learning

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1016/j.energy.2022.124986

Cite this

Han, L., Yang, K., Ma, T., Yang, N., Liu, H., & Guo, L. (2022). Battery life constrained real-time energy management strategy for hybrid electric vehicles based on reinforcement learning. Energy, 259, Article 124986. https://doi.org/10.1016/j.energy.2022.124986

@article{64c789738a314554acf856b423f3bd83,

title = "Battery life constrained real-time energy management strategy for hybrid electric vehicles based on reinforcement learning",

abstract = "Hybrid electric vehicles (HEVs) bridge the gap between internal combustion engine vehicles and pure electric vehicles, and are therefore regarded as a promising solution to the energy crisis. This paper proposes a real-time energy management strategy (EMS) for hybrid electric vehicles based on reinforcement learning (RL) to improve fuel economy and minimize battery degradation. First, an online recursive Markov chain (MC) is developed that continuously collects statistical features from actual driving conditions, and thus an adaptive and accurate environment model is established. Then, a novel RL algorithm, eligibility trace, is introduced to learn the control policy online based on MC model. By introducing a trace-decay parameter, the eligibility trace algorithm unifies the returns of different steps, forming a more reliable estimate of the optimal value function, and therefore outperforms traditional RL algorithms in optimization. Furthermore, induced matrix norm (IMN) is employed as a standard to measure difference between transition probability matrices (TPM) of MC and to decide when to update environment model as well as recalculate the control policy. Therefore, the EMS's adaptability to various driving conditions are significantly enhanced. Simulation results indicate that eligibility trace shows the best performance in both improving fuel economy and reducing battery life loss compared with Q-learning and rule-based method.",

keywords = "Battery life, Eligibility trace, Hybrid electric vehicle, Real-time energy management strategy, Reinforcement learning",

author = "Lijin Han and Ke Yang and Tian Ma and Ningkang Yang and Hui Liu and Lingxiong Guo",

note = "Publisher Copyright: {\textcopyright} 2022",

year = "2022",

month = nov,

day = "15",

doi = "10.1016/j.energy.2022.124986",

language = "English",

volume = "259",

journal = "Energy",

issn = "0360-5442",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Battery life constrained real-time energy management strategy for hybrid electric vehicles based on reinforcement learning

AU - Han, Lijin

AU - Yang, Ke

AU - Ma, Tian

AU - Yang, Ningkang

AU - Liu, Hui

AU - Guo, Lingxiong

PY - 2022/11/15

Y1 - 2022/11/15

N2 - Hybrid electric vehicles (HEVs) bridge the gap between internal combustion engine vehicles and pure electric vehicles, and are therefore regarded as a promising solution to the energy crisis. This paper proposes a real-time energy management strategy (EMS) for hybrid electric vehicles based on reinforcement learning (RL) to improve fuel economy and minimize battery degradation. First, an online recursive Markov chain (MC) is developed that continuously collects statistical features from actual driving conditions, and thus an adaptive and accurate environment model is established. Then, a novel RL algorithm, eligibility trace, is introduced to learn the control policy online based on MC model. By introducing a trace-decay parameter, the eligibility trace algorithm unifies the returns of different steps, forming a more reliable estimate of the optimal value function, and therefore outperforms traditional RL algorithms in optimization. Furthermore, induced matrix norm (IMN) is employed as a standard to measure difference between transition probability matrices (TPM) of MC and to decide when to update environment model as well as recalculate the control policy. Therefore, the EMS's adaptability to various driving conditions are significantly enhanced. Simulation results indicate that eligibility trace shows the best performance in both improving fuel economy and reducing battery life loss compared with Q-learning and rule-based method.

AB - Hybrid electric vehicles (HEVs) bridge the gap between internal combustion engine vehicles and pure electric vehicles, and are therefore regarded as a promising solution to the energy crisis. This paper proposes a real-time energy management strategy (EMS) for hybrid electric vehicles based on reinforcement learning (RL) to improve fuel economy and minimize battery degradation. First, an online recursive Markov chain (MC) is developed that continuously collects statistical features from actual driving conditions, and thus an adaptive and accurate environment model is established. Then, a novel RL algorithm, eligibility trace, is introduced to learn the control policy online based on MC model. By introducing a trace-decay parameter, the eligibility trace algorithm unifies the returns of different steps, forming a more reliable estimate of the optimal value function, and therefore outperforms traditional RL algorithms in optimization. Furthermore, induced matrix norm (IMN) is employed as a standard to measure difference between transition probability matrices (TPM) of MC and to decide when to update environment model as well as recalculate the control policy. Therefore, the EMS's adaptability to various driving conditions are significantly enhanced. Simulation results indicate that eligibility trace shows the best performance in both improving fuel economy and reducing battery life loss compared with Q-learning and rule-based method.

KW - Battery life

KW - Eligibility trace

KW - Hybrid electric vehicle

KW - Real-time energy management strategy

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85135883800&partnerID=8YFLogxK

U2 - 10.1016/j.energy.2022.124986

DO - 10.1016/j.energy.2022.124986

M3 - Article

AN - SCOPUS:85135883800

SN - 0360-5442

VL - 259

JO - Energy

JF - Energy

M1 - 124986

ER -

Battery life constrained real-time energy management strategy for hybrid electric vehicles based on reinforcement learning

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this