HILPS: Human-in-Loop Policy Search for Mobile Robot Navigation

Mingxing Wen, Yufeng Yue, Zhenyu Wu, Ehsan Mihankhan, Danwei Wang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Reinforcement learning has obtained increasing attention in mobile robot mapless navigation in recent years. However, there are still some obvious challenges including the sample efficiency, safety due to dilemma of exploration and exploitation. These problems are addressed in this paper by proposing the Human-in-Loop Policy Search (HILPS) framework, where learning from demonstration, learning from human intervention and Near Optimal Policy strategies are integrated together. Firstly, the former two make sure that expert experience grant mobile robot a more informative and correct decision for accomplishing the task and also maintaining the safety of the mobile robot due to the priority of human control. Then the Near Optimal Policy (NOP) provides a way to selectively store the similar experience with respect to the preexisting human demonstration, in which case the sample efficiency can be improved by eliminating exclusively exploratory behaviors. To verify the performance of the algorithm, the mobile robot navigation experiments are extensively conducted in simulation and real world. Results show that HILPS can improve sample efficiency and safety in comparison to state-of-art reinforcement learning.

源语言英语
主期刊名16th IEEE International Conference on Control, Automation, Robotics and Vision, ICARCV 2020
出版商Institute of Electrical and Electronics Engineers Inc.
387-392
页数6
ISBN(电子版)9781728177090
DOI
出版状态已出版 - 13 12月 2020
已对外发布
活动16th IEEE International Conference on Control, Automation, Robotics and Vision, ICARCV 2020 - Virtual, Shenzhen, 中国
期限: 13 12月 202015 12月 2020

出版系列

姓名16th IEEE International Conference on Control, Automation, Robotics and Vision, ICARCV 2020

会议

会议16th IEEE International Conference on Control, Automation, Robotics and Vision, ICARCV 2020
国家/地区中国
Virtual, Shenzhen
时期13/12/2015/12/20

指纹

探究 'HILPS: Human-in-Loop Policy Search for Mobile Robot Navigation' 的科研主题。它们共同构成独一无二的指纹。

引用此