A Path Planning Method Based on Deep Reinforcement Learning with Improved Prioritized Experience Replay for Human-Robot Collaboration

Deyu Sun, Jingqian Wen, Jingfei Wang, Xiaonan Yang*, Yaoguang Hu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Owing to its ability to integrate human flexibility with robotic automation, human-robot collaboration possesses tremendous potential in intelligent manufacturing. A quintessential characteristic of this collaboration is the necessity for robotic arms to cooperate with humans in a dynamically changing environment, wherein humans could be considered as dynamic obstacles. One of the significant challenges in human-robot collaboration is the development of obstacle avoidance strategies for robotic path planning within dynamically changing environments. The inability of traditional two-dimensional path planning methods to handle high-dimensional spaces, therefore, many researchers have turned their attention to deep reinforcement learning, and many deep reinforcement learning methods have been applied to robotic arm path planning. However, most deep reinforcement learning models for robotic arm path planning require a significant amount of training time to achieve convergence. In this study, we introduce an algorithm that synergizes Soft Actor-Critic (SAC) with an improved version of Prioritized Experience Replay (PER)—SAC-iPER. We prioritizes experiences based on task-rewards, employing metrics such as time consumption and collision occurrences, in addition to task completion, to rank experiences. This reward-based ordering significantly boosts the learning process in both speed and quality. The results of this study significantly enhanced the training efficiency of deep reinforcement learning models for robotic arm path planning within human-robot collaboration, paving the way for the development of more efficient human-robot collaborative systems.

源语言英语
主期刊名Human-Computer Interaction - Thematic Area, HCI 2024, Held as Part of the 26th HCI International Conference, HCII 2024, Proceedings
编辑Masaaki Kurosu, Ayako Hashizume
出版商Springer Science and Business Media Deutschland GmbH
196-206
页数11
ISBN(印刷版)9783031604119
DOI
出版状态已出版 - 2024
活动Human Computer Interaction thematic area of the 26th International Conference on Human-Computer Interaction, HCII 2024 - Washington, 美国
期限: 29 6月 20244 7月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14685 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议Human Computer Interaction thematic area of the 26th International Conference on Human-Computer Interaction, HCII 2024
国家/地区美国
Washington
时期29/06/244/07/24

指纹

探究 'A Path Planning Method Based on Deep Reinforcement Learning with Improved Prioritized Experience Replay for Human-Robot Collaboration' 的科研主题。它们共同构成独一无二的指纹。

引用此