Toward Multi-Task Generalization in Autonomous Navigation: A Human-in-the-Loop Adversarial Reinforcement Learning With Diffusion Policy

  • Dong Hu
  • , Chao Huang*
  • , Jingda Wu
  • , Xin Yuan
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Due to the complexity and variability of real-world environments, data-driven autonomous navigation strategies for autonomous ground vehicles have significant potential to improve performance and adaptability in diverse scenarios. Reinforcement learning (RL) has emerged as a promising approach for autonomous navigation. However, existing RL methods often struggle with low sample efficiency, limited adaptability, and poor generalization in dynamic multi-task scenarios. To address these issues, we propose a novel framework: human-in-the-loop adversarial RL with diffusion policy, designed for scalable and robust policy learning. This framework leverages a diffusion model as policy network, effectively exploring and learning high-dimensional, multi-modal behavior distributions. It also integrates human feedback to improve data efficiency and stabilize policy training. On top of this, adversarial training is employed to improve robustness and adaptability to change in tasks and distributions. The proposed method is trained in simulation, and then the well-trained policy is transferred to the real-world. Experimental results demonstrate that this approach significantly outperforms existing methods in terms of efficiency, stability, generalization, and multi-task adaptability, offering a promising solution for the next generation of autonomous navigation systems.

Original languageEnglish
Pages (from-to)19493-19507
Number of pages15
JournalIEEE Transactions on Intelligent Transportation Systems
Volume26
Issue number11
DOIs
Publication statusPublished - 2025
Externally publishedYes

Keywords

  • adversarial training
  • Autonomous navigation
  • diffusion policy
  • human-in-the-loop reinforcement learning
  • multi-task generalization

Fingerprint

Dive into the research topics of 'Toward Multi-Task Generalization in Autonomous Navigation: A Human-in-the-Loop Adversarial Reinforcement Learning With Diffusion Policy'. Together they form a unique fingerprint.

Cite this