TY - JOUR
T1 - Multiagent Reinforcement Learning for Ecological Car-Following Control in Mixed Traffic
AU - Wang, Qun
AU - Ju, Fei
AU - Wang, Huaiyu
AU - Qian, Yahui
AU - Zhu, Meixin
AU - Zhuang, Weichao
AU - Wang, Liangmo
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The push toward sustainable transportation emphasizes vehicular energy efficiency in mixed traffic scenarios. A research hotspot is the cooperative control of connected and automated vehicles (CAVs), particularly in contexts involving the uncertainties of human-driven vehicles (HDVs). Cooperative control strategies are pivotal in improving driving safety, traffic efficiency, and reducing energy consumption. Our study introduces a cooperative control strategy for CAVs in mixed traffic based on the multiagent twin delayed deep deterministic policy gradient (MATD3) algorithm. We use the intelligent driver model (IDM) to calibrate and model human driving behaviors with 1737 car-following events from the Next Generation Simulation (NGSIM) dataset for their high frequency in real-world driving. The reward function of MATD3 integrates safety, traffic efficiency, passenger comfort, and energy efficiency. An action mask scheme is incorporated to prevent collisions, thereby boosting learning efficiency. Monte Carlo simulation results show that our strategy outperforms IDM and model predictive control (MPC) in improving energy efficiency by an average of 7.73% and 3.38%, respectively. Furthermore, our framework offers extended benefits to HDVs, which achieve improved energy efficiency following the CAVs' control pattern. A case study further demonstrates that a "moderate"driving style results in lower energy consumption, effectively linking human behaviors to energy efficiency.
AB - The push toward sustainable transportation emphasizes vehicular energy efficiency in mixed traffic scenarios. A research hotspot is the cooperative control of connected and automated vehicles (CAVs), particularly in contexts involving the uncertainties of human-driven vehicles (HDVs). Cooperative control strategies are pivotal in improving driving safety, traffic efficiency, and reducing energy consumption. Our study introduces a cooperative control strategy for CAVs in mixed traffic based on the multiagent twin delayed deep deterministic policy gradient (MATD3) algorithm. We use the intelligent driver model (IDM) to calibrate and model human driving behaviors with 1737 car-following events from the Next Generation Simulation (NGSIM) dataset for their high frequency in real-world driving. The reward function of MATD3 integrates safety, traffic efficiency, passenger comfort, and energy efficiency. An action mask scheme is incorporated to prevent collisions, thereby boosting learning efficiency. Monte Carlo simulation results show that our strategy outperforms IDM and model predictive control (MPC) in improving energy efficiency by an average of 7.73% and 3.38%, respectively. Furthermore, our framework offers extended benefits to HDVs, which achieve improved energy efficiency following the CAVs' control pattern. A case study further demonstrates that a "moderate"driving style results in lower energy consumption, effectively linking human behaviors to energy efficiency.
KW - Car-following
KW - connected and automated vehicles (CAVs)
KW - eco-driving
KW - intelligent driving
KW - mixed traffic
KW - model predictive control (MPC)
KW - multiagent reinforcement learning (MARL)
UR - http://www.scopus.com/inward/record.url?scp=85190167594&partnerID=8YFLogxK
U2 - 10.1109/TTE.2024.3383091
DO - 10.1109/TTE.2024.3383091
M3 - Article
AN - SCOPUS:85190167594
SN - 2332-7782
VL - 10
SP - 8671
EP - 8684
JO - IEEE Transactions on Transportation Electrification
JF - IEEE Transactions on Transportation Electrification
IS - 4
ER -