Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction

Mingchi Zhu, Haoping She*, Weiyong Si, Chuanjun Li

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Existing imitation learning methods for human directional corrections may lead to learning incorrect behaviors due to erroneous artificial teaching, resulting in a significant increase in the required number of iterations and even non-convergence situations, which can affect the system's performance. Additionally, the high computational complexity makes it unsuitable for embedded real-time application scenarios. To address these two issues, this study proposes a lightweight imitation learning algorithm that pre-corrects human-directed corrections. This method utilizes a deep learning network trained on a small dataset to correct human directional corrections and designs a lower-dimensional cost function for imitation learning. The proposed approach is applied to the example of a drone passing through doorways. Through the construction of a simulation platform and conducting simulation verification, the results show that the algorithm incorporating the correction error detection mechanism achieves an accuracy of over 98% in discerning human corrections, reduces training time by 27.87% per iteration, and decreases the average number of rounds by approximately 40%. The results indicate that the algorithm, which combines correction detection based on deep learning and a low-dimensional cost function, improves the accuracy of algorithm iterations, reduces computational complexity, and enhances computational speed.

源语言英语
主期刊名ICAC 2024 - 29th International Conference on Automation and Computing
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350360882
DOI
出版状态已出版 - 2024
活动29th International Conference on Automation and Computing, ICAC 2024 - Sunderland, 英国
期限: 28 8月 202430 8月 2024

出版系列

姓名ICAC 2024 - 29th International Conference on Automation and Computing

会议

会议29th International Conference on Automation and Computing, ICAC 2024
国家/地区英国
Sunderland
时期28/08/2430/08/24

指纹

探究 'Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction' 的科研主题。它们共同构成独一无二的指纹。

引用此