TY - GEN
T1 - Generalizing Autonomous Navigation via Human-Guided Diffusion Policies with Adversarial Reinforcement Learning
AU - Hu, Dong
AU - Zhou, Yangyang
AU - Hu, Fengqing
AU - Huang, Chao
AU - Wu, Jingda
AU - Savkin, Andrey V.
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - To address the challenges of autonomous vehicle navigation in dynamic and complex environments-particularly in LiDAR-free and low-cost settings-this study proposes an online adversarial reinforcement learning (RL) framework that integrates real-time human guidance with diffusion-based policies. Unlike prior diffusion RL works, our method enables stable online training of diffusion policies by leveraging human guidance to stabilize data distribution and improve early-stage exploration. Furthermore, adversarial training is introduced to enhance robustness and generalization in dynamic multi-task scenarios. The framework uses only onboard cameras without reliance on prior maps or expensive sensors, allowing for costeffective deployment. Experimental results demonstrate that our approach substantially enhances adaptability and multi-task generalization, outperforming state-of-the-art baselines in training efficiency, success rate, and safety across various dynamic and complex environments.
AB - To address the challenges of autonomous vehicle navigation in dynamic and complex environments-particularly in LiDAR-free and low-cost settings-this study proposes an online adversarial reinforcement learning (RL) framework that integrates real-time human guidance with diffusion-based policies. Unlike prior diffusion RL works, our method enables stable online training of diffusion policies by leveraging human guidance to stabilize data distribution and improve early-stage exploration. Furthermore, adversarial training is introduced to enhance robustness and generalization in dynamic multi-task scenarios. The framework uses only onboard cameras without reliance on prior maps or expensive sensors, allowing for costeffective deployment. Experimental results demonstrate that our approach substantially enhances adaptability and multi-task generalization, outperforming state-of-the-art baselines in training efficiency, success rate, and safety across various dynamic and complex environments.
UR - https://www.scopus.com/pages/publications/105037014996
U2 - 10.1109/ITSC60802.2025.11423599
DO - 10.1109/ITSC60802.2025.11423599
M3 - Conference contribution
AN - SCOPUS:105037014996
T3 - IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
SP - 1301
EP - 1306
BT - IEEE Intelligent Transportation Systems Conference, ITSC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 28th International Conference on Intelligent Transportation Systems, ITSC 2025
Y2 - 18 November 2025 through 21 November 2025
ER -