跳到主要导航 跳到搜索 跳到主要内容

Generalizing Autonomous Navigation via Human-Guided Diffusion Policies with Adversarial Reinforcement Learning

  • Dong Hu
  • , Yangyang Zhou
  • , Fengqing Hu
  • , Chao Huang*
  • , Jingda Wu
  • , Andrey V. Savkin
  • *此作品的通讯作者
  • Hong Kong Polytechnic University
  • Beijing Institute of Technology
  • University of New South Wales

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

To address the challenges of autonomous vehicle navigation in dynamic and complex environments-particularly in LiDAR-free and low-cost settings-this study proposes an online adversarial reinforcement learning (RL) framework that integrates real-time human guidance with diffusion-based policies. Unlike prior diffusion RL works, our method enables stable online training of diffusion policies by leveraging human guidance to stabilize data distribution and improve early-stage exploration. Furthermore, adversarial training is introduced to enhance robustness and generalization in dynamic multi-task scenarios. The framework uses only onboard cameras without reliance on prior maps or expensive sensors, allowing for costeffective deployment. Experimental results demonstrate that our approach substantially enhances adaptability and multi-task generalization, outperforming state-of-the-art baselines in training efficiency, success rate, and safety across various dynamic and complex environments.

源语言英语
主期刊名IEEE Intelligent Transportation Systems Conference, ITSC 2025
出版商Institute of Electrical and Electronics Engineers Inc.
1301-1306
页数6
ISBN(电子版)9798331524180
DOI
出版状态已出版 - 2025
已对外发布
活动28th International Conference on Intelligent Transportation Systems, ITSC 2025 - Gold Coast, 澳大利亚
期限: 18 11月 202521 11月 2025

出版系列

姓名IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
ISSN(印刷版)2153-0009
ISSN(电子版)2153-0017

会议

会议28th International Conference on Intelligent Transportation Systems, ITSC 2025
国家/地区澳大利亚
Gold Coast
时期18/11/2521/11/25

指纹

探究 'Generalizing Autonomous Navigation via Human-Guided Diffusion Policies with Adversarial Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此