Q-learning based method of adaptive path planning for mobile robot

Li Yibin*, Li Caihong, Zhang Zijian

*此作品的通讯作者

科研成果: 会议稿件论文同行评审

9 引用 (Scopus)

摘要

Reinforcement learning (RL) is a learning technique based on trial and error. Q-learning is a method of RL algorithms. It has been applied widely in the adaptive path planning for the autonomous mobile robot. In order to decrease the learning space and increase the learning convergent speed, this paper adopts Q-layered learning method to divide the task of searching optimal path into three basic behaviors (or subtasks), namely static obstacle-avoidance, dynamic obstacle-avoidance and goal approaching. Especially in the learning for the static obstacle-avoidance behavior, a novel priority Q search method (PQA) is used to avoid the blindly search of the random search algorithm (RA) which is always used to select actions in Q-learning. PQA uses the sum of weighted vectors pointing away from obstacles to predict the magnitude of the reinforcement reward receiving from the possible state-action after executing the action. Robot controller will select an action based on the result at the next executing time. At last PQA and RA are both simulated in two different environments. The learning results show that learn steps are fewer by PQA than by RA under same environment to achieve the task. And in the total learning periods PQA has the higher task complete percent. PQA is an effective way to solve the problem of the path planning under dynamic and unknown environment.

源语言英语
983-987
页数5
DOI
出版状态已出版 - 2006
已对外发布
活动2006 IEEE International Conference on Information Acquisition, ICIA 2006 - Weihai, Shandong, 中国
期限: 20 8月 200623 8月 2006

会议

会议2006 IEEE International Conference on Information Acquisition, ICIA 2006
国家/地区中国
Weihai, Shandong
时期20/08/0623/08/06

指纹

探究 'Q-learning based method of adaptive path planning for mobile robot' 的科研主题。它们共同构成独一无二的指纹。

引用此