TY - GEN
T1 - Autonomous Vehicle Dynamic Following Based on Improved Reinforcement Learning
AU - Song, Chunlei
AU - Yang, Xinyan
AU - Xu, Jianhua
AU - Zhang, Xiongfei
AU - Zhang, Chengyu
N1 - Publisher Copyright:
© 2025 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2025
Y1 - 2025
N2 - Target following technology, as one of the core technologies for autonomous vehicles, enables vehicles to automatically track dynamic targets. This paper proposes a path planning algorithm that combines proximal policy optimization(PPO) with Model predictive control(MPC) for dynamic target following tasks of autonomous vehicles in unknown environments. The algorithm utilizes neural networks to reduce computational load and employs MPC to compensate for the shortcomings of PPO in the prediction phase, thereby enhancing the following and obstacle avoidance capabilities of autonomous vehicles in dynamic environments. Simulation experiments show that the proposed PPO-MPC improves training speed by 35%, achieves trajectory following effects in low-speed dynamic target following tasks, and reduces the time required for static and dynamic obstacle avoidance tasks by approximately 8.5% compared to the original PPO, with smoother trajectories. It is capable of performing dynamic target following tasks in unknown environments with multiple obstacles.
AB - Target following technology, as one of the core technologies for autonomous vehicles, enables vehicles to automatically track dynamic targets. This paper proposes a path planning algorithm that combines proximal policy optimization(PPO) with Model predictive control(MPC) for dynamic target following tasks of autonomous vehicles in unknown environments. The algorithm utilizes neural networks to reduce computational load and employs MPC to compensate for the shortcomings of PPO in the prediction phase, thereby enhancing the following and obstacle avoidance capabilities of autonomous vehicles in dynamic environments. Simulation experiments show that the proposed PPO-MPC improves training speed by 35%, achieves trajectory following effects in low-speed dynamic target following tasks, and reduces the time required for static and dynamic obstacle avoidance tasks by approximately 8.5% compared to the original PPO, with smoother trajectories. It is capable of performing dynamic target following tasks in unknown environments with multiple obstacles.
KW - Autonomous vehicles
KW - Dynamic target following
KW - Model predictive control
KW - Proximal Policy Optimization
UR - https://www.scopus.com/pages/publications/105020316214
U2 - 10.23919/CCC64809.2025.11179268
DO - 10.23919/CCC64809.2025.11179268
M3 - Conference contribution
AN - SCOPUS:105020316214
T3 - Chinese Control Conference, CCC
SP - 4323
EP - 4328
BT - Proceedings of the 44th Chinese Control Conference, CCC 2025
A2 - Sun, Jian
A2 - Yin, Hongpeng
PB - IEEE Computer Society
T2 - 44th Chinese Control Conference, CCC 2025
Y2 - 28 July 2025 through 30 July 2025
ER -