TY - JOUR
T1 - Interact, Instruct to Improve
T2 - A LLM-Driven Parallel Actor-Reasoner Framework for Enhancing Autonomous Vehicle Interactions
AU - Fang, Shiyu
AU - Liu, Jiaqi
AU - Xu, Chengkai
AU - Lv, Chen
AU - Hang, Peng
AU - Sun, Jian
N1 - Publisher Copyright:
© 2000-2011 IEEE.
PY - 2026
Y1 - 2026
N2 - Autonomous Vehicles (AVs) have entered the stage of commercialization, yet their performance in interactive scenarios remains unsatisfactory due to challenges such as decision interpretability, human driver (HV) heterogeneity, and scenario diversity. Recent advances in Large Language Models (LLMs) provide a promising avenue to enhance AV interaction capabilities, but their high computational demand hinders practical deployment. To address these challenges, this paper introduces a parallel Actor-Reasoner framework designed to enable explicit and real-time bidirectional AV-HV interactions. First, the Reasoner employs a localized LLM with CoT reasoning and human instructions to progressively infer HV intent, style, AV action, and eHMI displays during the training stage. During testing, it continues to infer he above information except AV action. The Actor, in turn, is constructed as an interaction memory through the Reasoner's interactions with heterogeneous simulated HVs across diverse scenarios, where the memory partition and two-layer retrieval modules are employed in the construction process. During testing, the Actor is used to retrieve feasible actions for the AV. Ablation studies across multiple scenarios demonstrate that the proposed modules improve interaction success rates by an average of 15% and 12%, respectively. Moreover, comparison studies in multi-vehicle scenarios further show that the proposed Actor-Reasoner framework achieves superior safety while simultaneously improving efficiency. Finally, with the integration of external Human-Machine Interface (eHMI) information derived from the Reasoner's reasoning and feasible actions retrieved from the Actor, the framework is validated in real-world field interactions. Our code is available at https://github.com/FanGShiYuu/Actor-Reasoner.
AB - Autonomous Vehicles (AVs) have entered the stage of commercialization, yet their performance in interactive scenarios remains unsatisfactory due to challenges such as decision interpretability, human driver (HV) heterogeneity, and scenario diversity. Recent advances in Large Language Models (LLMs) provide a promising avenue to enhance AV interaction capabilities, but their high computational demand hinders practical deployment. To address these challenges, this paper introduces a parallel Actor-Reasoner framework designed to enable explicit and real-time bidirectional AV-HV interactions. First, the Reasoner employs a localized LLM with CoT reasoning and human instructions to progressively infer HV intent, style, AV action, and eHMI displays during the training stage. During testing, it continues to infer he above information except AV action. The Actor, in turn, is constructed as an interaction memory through the Reasoner's interactions with heterogeneous simulated HVs across diverse scenarios, where the memory partition and two-layer retrieval modules are employed in the construction process. During testing, the Actor is used to retrieve feasible actions for the AV. Ablation studies across multiple scenarios demonstrate that the proposed modules improve interaction success rates by an average of 15% and 12%, respectively. Moreover, comparison studies in multi-vehicle scenarios further show that the proposed Actor-Reasoner framework achieves superior safety while simultaneously improving efficiency. Finally, with the integration of external Human-Machine Interface (eHMI) information derived from the Reasoner's reasoning and feasible actions retrieved from the Actor, the framework is validated in real-world field interactions. Our code is available at https://github.com/FanGShiYuu/Actor-Reasoner.
KW - Autonomous vehicles
KW - driving interaction
KW - external human-machine interface
KW - large language model
KW - memory retrieval
UR - https://www.scopus.com/pages/publications/105023090155
U2 - 10.1109/TITS.2025.3633264
DO - 10.1109/TITS.2025.3633264
M3 - Article
AN - SCOPUS:105023090155
SN - 1524-9050
VL - 27
SP - 2000
EP - 2012
JO - IEEE Transactions on Intelligent Transportation Systems
JF - IEEE Transactions on Intelligent Transportation Systems
IS - 2
ER -