Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections

Xuemei Chen, Jiahe Liu*, Zijia Wang, Xintong Han, Yufan Sun, Xuelong Zheng

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

Behavioral decision-making at urban intersections is one of the primary difficulties currently impeding the development of intelligent vehicle technology. The problem is that existing decision-making algorithms cannot effectively deal with complex random scenarios at urban intersections. To deal with this, a deep deterministic policy gradient (DDPG) decision-making algorithm (T-DDPG) based on a time-series Markov decision process (T-MDP) was developed, where the state was extended to collect observations from several consecutive frames. Experiments found that T-DDPG performed better in terms of convergence and generalizability in complex intersection scenarios than a traditional DDPG algorithm. Furthermore, model-agnostic meta-learning (MAML) was incorporated into the T-DDPG algorithm to improve the training method, leading to a decision algorithm (T-MAML-DDPG) based on a secondary gradient. Simulation experiments of intersection scenarios were carried out on the Gym-Carla platform to verify and compare the decision models. The results showed that T-MAML-DDPG was able to easily deal with the random states of complex intersection scenarios, which could improve traffic safety and efficiency. The above decision-making models based on meta-reinforcement learning are significant for enhancing the decision-making ability of intelligent vehicles at urban intersections.

Original languageEnglish
Pages (from-to)327-339
Number of pages13
JournalJournal of Beijing Institute of Technology (English Edition)
Volume31
Issue number4
DOIs
Publication statusPublished - Aug 2022

Keywords

  • decision-making
  • intelligent vehicles
  • meta learning
  • reinforcement learning
  • urban intersections

Fingerprint

Dive into the research topics of 'Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections'. Together they form a unique fingerprint.

Cite this