Learning to Grasp through World Models with Backward Prediction

Shuze Wang*, Yetian Yuan, Yunpeng Mei, Jian Sun, Gang Wang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the era of the fourth industrial revolution, the paradigm of intelligent manufacturing has ascended as a pivotal driver for future advancements. The combination of intelligent manufacturing with reinforcement learning offers significant benefits and opportunities. However, applying reinforcement learning in practical settings faces challenges due to sampling efficiency. Recently, model-based reinforcement learning algorithms have demonstrated higher sample efficiency. In this research, we introduce an enhanced version of the Dreamer algorithm and devise a system dedicated to mastering the tasks of picking and placing through the limited data collected. The improved algorithm addresses the issue of low sample efficiency and proves to be exceptionally effective in pixel-based tasks associated with high-dimensional state spaces. To corroborate the efficacy of the proposed algorithm, we evaluate the algorithm on a simulation environment of a production line for robotic arm operations developed in Cop-peliasim. The merits of the augmented algorithm are further corroborated through its performance in complex DeepMind control tasks.

Original languageEnglish
Title of host publicationProceedings of the 43rd Chinese Control Conference, CCC 2024
EditorsJing Na, Jian Sun
PublisherIEEE Computer Society
Pages8810-8815
Number of pages6
ISBN (Electronic)9789887581581
DOIs
Publication statusPublished - 2024
Event43rd Chinese Control Conference, CCC 2024 - Kunming, China
Duration: 28 Jul 202431 Jul 2024

Publication series

NameChinese Control Conference, CCC
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference43rd Chinese Control Conference, CCC 2024
Country/TerritoryChina
CityKunming
Period28/07/2431/07/24

Keywords

  • Reinforcement Learning
  • Robot Arm
  • World Model

Fingerprint

Dive into the research topics of 'Learning to Grasp through World Models with Backward Prediction'. Together they form a unique fingerprint.

Cite this