Optimization Method of Learning from Demonstration based on Incremental GMR-GP

Zhiqiang Xia, Di Hua Zhai*, Haocun Wu, Yuanqing Xia

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The trajectory generated by GMR-GP cannot go through the first via-point precisely. Therefore, this paper designs an optimization method of Learning from Demonstration based on incremental GMR-GP. First, an incremental GMR-GP algorithm is designed. The advantage of the incremental GMR-GP is that the starting point of the mean trajectory can gradually approach to the real observation point while maintaining the true intention of the teaching action as much as possible. Then, an incremental GMR-GP based on importance weighting is proposed, which makes an important distinction among the new trajectories generated by the incremental GMR-GP. The generated trajectory further improves the reservation of the prior trajectory on the true teaching intention, and the uncertainty is reduced. Moreover, the availability of the proposed method is validated and analyzed by performing a series of numerical simulations and Baxter robot experiments. The results indicate that the proposed method can provide reliable solutions, which can go through the first via-point more precisely while retaining the true demonstrating intent as much as possible.

源语言英语
主期刊名Proceedings of the 40th Chinese Control Conference, CCC 2021
编辑Chen Peng, Jian Sun
出版商IEEE Computer Society
4050-4055
页数6
ISBN(电子版)9789881563804
DOI
出版状态已出版 - 26 7月 2021
活动40th Chinese Control Conference, CCC 2021 - Shanghai, 中国
期限: 26 7月 202128 7月 2021

出版系列

姓名Chinese Control Conference, CCC
2021-July
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议40th Chinese Control Conference, CCC 2021
国家/地区中国
Shanghai
时期26/07/2128/07/21

指纹

探究 'Optimization Method of Learning from Demonstration based on Incremental GMR-GP' 的科研主题。它们共同构成独一无二的指纹。

引用此