Trajectory Planning Algorithm of Manipulator in Small Space Based on Reinforcement Learning

Haoyu Wang, Huaishi Zhu, Fangfei Cao*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The development of reinforcement learning has driven the progress of robot control technology. In recent years, reinforcement learning has become one of the highly concerned fields in the academic community, especially the control of robotic arms in the industrial field. In order to achieve intelligent and efficient production, the emphasis is on the research of obstacle avoidance motion planning of the manipulator. However, traditional trajectory planning algorithms have problems such as slow convergence speed, low intelligence, and difficulty in achieving optimization. In this regard, this research takes the six degrees of freedom manipulator PUMA550 as the research object, and focuses on the obstacle avoidance motion planning problem of the manipulator, studies the manipulator modeling based on the improved D-H parameter method, Rapidly-exploring Random Trees (RRT) algorithm, the Q-learning algorithm and the double Q network learning alzorithm.

Original languageEnglish
Title of host publicationProceedings - 2023 China Automation Congress, CAC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5780-5785
Number of pages6
ISBN (Electronic)9798350303759
DOIs
Publication statusPublished - 2023
Event2023 China Automation Congress, CAC 2023 - Chongqing, China
Duration: 17 Nov 202319 Nov 2023

Publication series

NameProceedings - 2023 China Automation Congress, CAC 2023

Conference

Conference2023 China Automation Congress, CAC 2023
Country/TerritoryChina
CityChongqing
Period17/11/2319/11/23

Keywords

  • Deep Reinforcement Learning
  • Manipulator
  • Reinforcement Learning
  • Trajector -y Planning

Fingerprint

Dive into the research topics of 'Trajectory Planning Algorithm of Manipulator in Small Space Based on Reinforcement Learning'. Together they form a unique fingerprint.

Cite this