跳到主要导航 跳到搜索 跳到主要内容

Battery life constrained real-time energy management strategy for hybrid electric vehicles based on reinforcement learning

  • Lijin Han
  • , Ke Yang
  • , Tian Ma
  • , Ningkang Yang*
  • , Hui Liu
  • , Lingxiong Guo
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • China North Vehicle Research Institute

科研成果: 期刊稿件文章同行评审

摘要

Hybrid electric vehicles (HEVs) bridge the gap between internal combustion engine vehicles and pure electric vehicles, and are therefore regarded as a promising solution to the energy crisis. This paper proposes a real-time energy management strategy (EMS) for hybrid electric vehicles based on reinforcement learning (RL) to improve fuel economy and minimize battery degradation. First, an online recursive Markov chain (MC) is developed that continuously collects statistical features from actual driving conditions, and thus an adaptive and accurate environment model is established. Then, a novel RL algorithm, eligibility trace, is introduced to learn the control policy online based on MC model. By introducing a trace-decay parameter, the eligibility trace algorithm unifies the returns of different steps, forming a more reliable estimate of the optimal value function, and therefore outperforms traditional RL algorithms in optimization. Furthermore, induced matrix norm (IMN) is employed as a standard to measure difference between transition probability matrices (TPM) of MC and to decide when to update environment model as well as recalculate the control policy. Therefore, the EMS's adaptability to various driving conditions are significantly enhanced. Simulation results indicate that eligibility trace shows the best performance in both improving fuel economy and reducing battery life loss compared with Q-learning and rule-based method.

源语言英语
文章编号124986
期刊Energy
259
DOI
出版状态已出版 - 15 11月 2022

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 7 - 经济适用的清洁能源
    可持续发展目标 7 经济适用的清洁能源

指纹

探究 'Battery life constrained real-time energy management strategy for hybrid electric vehicles based on reinforcement learning' 的科研主题。它们共同构成独一无二的指纹。

引用此