Curiosity-driven reinforcement learning with graph transformers for decision-making in connected and autonomous vehicles

Qi Liu, Yujie Tang, Xueyuan Li*, Kaifeng Wang, Fan Yang, Zirui Li

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Cooperative decision-making technology for connected and autonomous vehicles (CAVs) in mixed autonomy traffic is critical for the advancement of modern intelligent transportation systems. Recently, graph reinforcement learning (GRL) approaches have shown remarkable success in addressing decision-making challenges by leveraging graph-based technologies. However, existing GRL-based research faces substantial challenges in generating accurate feature embeddings to enhance driving policies, thoroughly exploring the driving environment, and efficiently training models. To address these challenges, this paper proposes a graph transformer reinforcement learning method with a distributional curiosity mechanism to improve the feature generation efficiency and environment exploration, ultimately boosting the decision-making performance of CAVs. First, an improved transformed graph convolutional network (ITransGCN) is proposed, integrating graph convolutional network (GCN), rotary position encoding method (ROPE), and temporal prior attention mechanism to strengthen sequential modeling capabilities, thereby generating informative spatial–temporal feature embeddings. Then, a curiosity mechanism based on distributional random network distillation (DRND) is proposed to enhance the exploratory capabilities of CAVs in driving environments. Additionally, a temporal integrated deep reinforcement learning (TI-DRL) model is developed, incorporating an auxiliary loss that integrates spatial–temporal information to improve the model's ability to capture the spatial–temporal dependencies. Finally, a cooperation-aware reward function is constructed to further evaluate the performance of CAVs. Comprehensive experiments are conducted across three representative traffic scenarios to validate the proposed method. The results demonstrate that our proposed method outperforms the baselines in driving safety, efficiency, and model stability, highlighting the effectiveness of the core components and the generalization capability of the proposed method.

Original languageEnglish
Article number105183
JournalTransportation Research Part C: Emerging Technologies
Volume177
DOIs
Publication statusPublished - Aug 2025
Externally publishedYes

Keywords

  • Connected and autonomous vehicles
  • Curiosity mechanism
  • Decision-making
  • Graph reinforcement learning
  • Transformer

Fingerprint

Dive into the research topics of 'Curiosity-driven reinforcement learning with graph transformers for decision-making in connected and autonomous vehicles'. Together they form a unique fingerprint.

Cite this