Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections

Xuemei Chen; Jiahe Liu; Zijia Wang; Xintong Han; Yufan Sun; Xuelong Zheng

doi:10.15918/j.jbit1004-0579.2022.056

Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections

Xuemei Chen, Jiahe Liu^*, Zijia Wang, Xintong Han, Yufan Sun, Xuelong Zheng

^*此作品的通讯作者

前沿交叉科学研究院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Behavioral decision-making at urban intersections is one of the primary difficulties currently impeding the development of intelligent vehicle technology. The problem is that existing decision-making algorithms cannot effectively deal with complex random scenarios at urban intersections. To deal with this, a deep deterministic policy gradient (DDPG) decision-making algorithm (T-DDPG) based on a time-series Markov decision process (T-MDP) was developed, where the state was extended to collect observations from several consecutive frames. Experiments found that T-DDPG performed better in terms of convergence and generalizability in complex intersection scenarios than a traditional DDPG algorithm. Furthermore, model-agnostic meta-learning (MAML) was incorporated into the T-DDPG algorithm to improve the training method, leading to a decision algorithm (T-MAML-DDPG) based on a secondary gradient. Simulation experiments of intersection scenarios were carried out on the Gym-Carla platform to verify and compare the decision models. The results showed that T-MAML-DDPG was able to easily deal with the random states of complex intersection scenarios, which could improve traffic safety and efficiency. The above decision-making models based on meta-reinforcement learning are significant for enhancing the decision-making ability of intelligent vehicles at urban intersections.

源语言	英语
页（从-至）	327-339
页数	13
期刊	Journal of Beijing Institute of Technology (English Edition)
卷	31
期	4
DOI	https://doi.org/10.15918/j.jbit1004-0579.2022.056
出版状态	已出版 - 8月 2022

访问文件

10.15918/j.jbit1004-0579.2022.056

其它文件与链接

链接到 Scopus 的出版物

引用此

Chen, X., Liu, J., Wang, Z., Han, X., Sun, Y., & Zheng, X. (2022). Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections. Journal of Beijing Institute of Technology (English Edition), 31(4), 327-339. https://doi.org/10.15918/j.jbit1004-0579.2022.056

@article{e8d5c0ffd8804df3b31b8da9e43bf697,

title = "Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections",

abstract = "Behavioral decision-making at urban intersections is one of the primary difficulties currently impeding the development of intelligent vehicle technology. The problem is that existing decision-making algorithms cannot effectively deal with complex random scenarios at urban intersections. To deal with this, a deep deterministic policy gradient (DDPG) decision-making algorithm (T-DDPG) based on a time-series Markov decision process (T-MDP) was developed, where the state was extended to collect observations from several consecutive frames. Experiments found that T-DDPG performed better in terms of convergence and generalizability in complex intersection scenarios than a traditional DDPG algorithm. Furthermore, model-agnostic meta-learning (MAML) was incorporated into the T-DDPG algorithm to improve the training method, leading to a decision algorithm (T-MAML-DDPG) based on a secondary gradient. Simulation experiments of intersection scenarios were carried out on the Gym-Carla platform to verify and compare the decision models. The results showed that T-MAML-DDPG was able to easily deal with the random states of complex intersection scenarios, which could improve traffic safety and efficiency. The above decision-making models based on meta-reinforcement learning are significant for enhancing the decision-making ability of intelligent vehicles at urban intersections.",

keywords = "decision-making, intelligent vehicles, meta learning, reinforcement learning, urban intersections",

author = "Xuemei Chen and Jiahe Liu and Zijia Wang and Xintong Han and Yufan Sun and Xuelong Zheng",

year = "2022",

month = aug,

doi = "10.15918/j.jbit1004-0579.2022.056",

language = "English",

volume = "31",

pages = "327--339",

journal = "Journal of Beijing Institute of Technology (English Edition)",

issn = "1004-0579",

publisher = "Beijing Institute of Technology",

number = "4",

}

TY - JOUR

T1 - Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections

AU - Chen, Xuemei

AU - Liu, Jiahe

AU - Wang, Zijia

AU - Han, Xintong

AU - Sun, Yufan

AU - Zheng, Xuelong

PY - 2022/8

Y1 - 2022/8

N2 - Behavioral decision-making at urban intersections is one of the primary difficulties currently impeding the development of intelligent vehicle technology. The problem is that existing decision-making algorithms cannot effectively deal with complex random scenarios at urban intersections. To deal with this, a deep deterministic policy gradient (DDPG) decision-making algorithm (T-DDPG) based on a time-series Markov decision process (T-MDP) was developed, where the state was extended to collect observations from several consecutive frames. Experiments found that T-DDPG performed better in terms of convergence and generalizability in complex intersection scenarios than a traditional DDPG algorithm. Furthermore, model-agnostic meta-learning (MAML) was incorporated into the T-DDPG algorithm to improve the training method, leading to a decision algorithm (T-MAML-DDPG) based on a secondary gradient. Simulation experiments of intersection scenarios were carried out on the Gym-Carla platform to verify and compare the decision models. The results showed that T-MAML-DDPG was able to easily deal with the random states of complex intersection scenarios, which could improve traffic safety and efficiency. The above decision-making models based on meta-reinforcement learning are significant for enhancing the decision-making ability of intelligent vehicles at urban intersections.

AB - Behavioral decision-making at urban intersections is one of the primary difficulties currently impeding the development of intelligent vehicle technology. The problem is that existing decision-making algorithms cannot effectively deal with complex random scenarios at urban intersections. To deal with this, a deep deterministic policy gradient (DDPG) decision-making algorithm (T-DDPG) based on a time-series Markov decision process (T-MDP) was developed, where the state was extended to collect observations from several consecutive frames. Experiments found that T-DDPG performed better in terms of convergence and generalizability in complex intersection scenarios than a traditional DDPG algorithm. Furthermore, model-agnostic meta-learning (MAML) was incorporated into the T-DDPG algorithm to improve the training method, leading to a decision algorithm (T-MAML-DDPG) based on a secondary gradient. Simulation experiments of intersection scenarios were carried out on the Gym-Carla platform to verify and compare the decision models. The results showed that T-MAML-DDPG was able to easily deal with the random states of complex intersection scenarios, which could improve traffic safety and efficiency. The above decision-making models based on meta-reinforcement learning are significant for enhancing the decision-making ability of intelligent vehicles at urban intersections.

KW - decision-making

KW - intelligent vehicles

KW - meta learning

KW - reinforcement learning

KW - urban intersections

UR - http://www.scopus.com/inward/record.url?scp=85138473559&partnerID=8YFLogxK

U2 - 10.15918/j.jbit1004-0579.2022.056

DO - 10.15918/j.jbit1004-0579.2022.056

M3 - Article

AN - SCOPUS:85138473559

SN - 1004-0579

VL - 31

SP - 327

EP - 339

JO - Journal of Beijing Institute of Technology (English Edition)

JF - Journal of Beijing Institute of Technology (English Edition)

IS - 4

ER -

Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections

摘要

访问文件

其它文件与链接

指纹

引用此