Abstract
To improve the collision avoidance capability of Automated Guided Vehicles (AGV) in the complex dynamic environment of smart factories,enable them to carry out material handling tasks more safely and efficiently following the global path, a local collision avoidance method based on deep reinforcement learning was proposed. The problem of collision avoidance of AGV was formulated as Partial Observational Markov Decision Process (POMDP) in which observation space, action space and reward function were expatiated. Tracking of the global path was a-chieved by setting different reward values. Then a Deep Deterministic Policy Gradient (DDPG) method was further implemented to solve collision avoidance policy. The trained policy was validated in various simulated scenarios, and the effectiveness was proved. The experimental results showed the proposed approach could respond to the complex dynamic environment and reduce the time and distance of collision avoidance.
Translated title of the contribution | Collision avoidance for AGV based on deep reinforcement learning in complex dynamic environment |
---|---|
Original language | Chinese (Traditional) |
Pages (from-to) | 236-245 |
Number of pages | 10 |
Journal | Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS |
Volume | 29 |
Issue number | 1 |
DOIs | |
Publication status | Published - 31 Jan 2023 |