复杂动态环境下基于深度强化学习的AGV避障方法

Ze Cai, Yaoguang Hu*, Jingqian Wen, Lixiang Zhang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

To improve the collision avoidance capability of Automated Guided Vehicles (AGV) in the complex dynamic environment of smart factories,enable them to carry out material handling tasks more safely and efficiently following the global path, a local collision avoidance method based on deep reinforcement learning was proposed. The problem of collision avoidance of AGV was formulated as Partial Observational Markov Decision Process (POMDP) in which observation space, action space and reward function were expatiated. Tracking of the global path was a-chieved by setting different reward values. Then a Deep Deterministic Policy Gradient (DDPG) method was further implemented to solve collision avoidance policy. The trained policy was validated in various simulated scenarios, and the effectiveness was proved. The experimental results showed the proposed approach could respond to the complex dynamic environment and reduce the time and distance of collision avoidance.

投稿的翻译标题Collision avoidance for AGV based on deep reinforcement learning in complex dynamic environment
源语言繁体中文
页(从-至)236-245
页数10
期刊Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS
29
1
DOI
出版状态已出版 - 31 1月 2023

关键词

  • deep reinforcement learning
  • dynamic collision avoidance
  • smart factory
  • tracking of global path

指纹

探究 '复杂动态环境下基于深度强化学习的AGV避障方法' 的科研主题。它们共同构成独一无二的指纹。

引用此