Optimization Method of Learning from Demonstration based on Incremental GMR-GP

Zhiqiang Xia, Di Hua Zhai*, Haocun Wu, Yuanqing Xia

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The trajectory generated by GMR-GP cannot go through the first via-point precisely. Therefore, this paper designs an optimization method of Learning from Demonstration based on incremental GMR-GP. First, an incremental GMR-GP algorithm is designed. The advantage of the incremental GMR-GP is that the starting point of the mean trajectory can gradually approach to the real observation point while maintaining the true intention of the teaching action as much as possible. Then, an incremental GMR-GP based on importance weighting is proposed, which makes an important distinction among the new trajectories generated by the incremental GMR-GP. The generated trajectory further improves the reservation of the prior trajectory on the true teaching intention, and the uncertainty is reduced. Moreover, the availability of the proposed method is validated and analyzed by performing a series of numerical simulations and Baxter robot experiments. The results indicate that the proposed method can provide reliable solutions, which can go through the first via-point more precisely while retaining the true demonstrating intent as much as possible.

Original languageEnglish
Title of host publicationProceedings of the 40th Chinese Control Conference, CCC 2021
EditorsChen Peng, Jian Sun
PublisherIEEE Computer Society
Pages4050-4055
Number of pages6
ISBN (Electronic)9789881563804
DOIs
Publication statusPublished - 26 Jul 2021
Event40th Chinese Control Conference, CCC 2021 - Shanghai, China
Duration: 26 Jul 202128 Jul 2021

Publication series

NameChinese Control Conference, CCC
Volume2021-July
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference40th Chinese Control Conference, CCC 2021
Country/TerritoryChina
CityShanghai
Period26/07/2128/07/21

Keywords

  • Imitation learning
  • Importance weighting
  • Incremental GMR-GP

Fingerprint

Dive into the research topics of 'Optimization Method of Learning from Demonstration based on Incremental GMR-GP'. Together they form a unique fingerprint.

Cite this