Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction

Mingchi Zhu; Haoping She; Weiyong Si; Chuanjun Li

doi:10.1109/ICAC61394.2024.10718779

Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction

Mingchi Zhu, Haoping She^*, Weiyong Si, Chuanjun Li

^*此作品的通讯作者

宇航学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Existing imitation learning methods for human directional corrections may lead to learning incorrect behaviors due to erroneous artificial teaching, resulting in a significant increase in the required number of iterations and even non-convergence situations, which can affect the system's performance. Additionally, the high computational complexity makes it unsuitable for embedded real-time application scenarios. To address these two issues, this study proposes a lightweight imitation learning algorithm that pre-corrects human-directed corrections. This method utilizes a deep learning network trained on a small dataset to correct human directional corrections and designs a lower-dimensional cost function for imitation learning. The proposed approach is applied to the example of a drone passing through doorways. Through the construction of a simulation platform and conducting simulation verification, the results show that the algorithm incorporating the correction error detection mechanism achieves an accuracy of over 98% in discerning human corrections, reduces training time by 27.87% per iteration, and decreases the average number of rounds by approximately 40%. The results indicate that the algorithm, which combines correction detection based on deep learning and a low-dimensional cost function, improves the accuracy of algorithm iterations, reduces computational complexity, and enhances computational speed.

源语言	英语
主期刊名	ICAC 2024 - 29th International Conference on Automation and Computing
出版商	Institute of Electrical and Electronics Engineers Inc.
ISBN（电子版）	9798350360882
DOI	https://doi.org/10.1109/ICAC61394.2024.10718779
出版状态	已出版 - 2024
活动	29th International Conference on Automation and Computing, ICAC 2024 - Sunderland, 英国期限: 28 8月 2024 → 30 8月 2024

出版系列

姓名	ICAC 2024 - 29th International Conference on Automation and Computing

会议

会议	29th International Conference on Automation and Computing, ICAC 2024
国家/地区	英国
市	Sunderland
时期	28/08/24 → 30/08/24

访问文件

10.1109/ICAC61394.2024.10718779

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhu, M., She, H., Si, W., & Li, C. (2024). Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction. 在 ICAC 2024 - 29th International Conference on Automation and Computing (ICAC 2024 - 29th International Conference on Automation and Computing). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICAC61394.2024.10718779

@inproceedings{4c9971b6e02d470ba06c96008ecb75e4,

title = "Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction",

abstract = "Existing imitation learning methods for human directional corrections may lead to learning incorrect behaviors due to erroneous artificial teaching, resulting in a significant increase in the required number of iterations and even non-convergence situations, which can affect the system's performance. Additionally, the high computational complexity makes it unsuitable for embedded real-time application scenarios. To address these two issues, this study proposes a lightweight imitation learning algorithm that pre-corrects human-directed corrections. This method utilizes a deep learning network trained on a small dataset to correct human directional corrections and designs a lower-dimensional cost function for imitation learning. The proposed approach is applied to the example of a drone passing through doorways. Through the construction of a simulation platform and conducting simulation verification, the results show that the algorithm incorporating the correction error detection mechanism achieves an accuracy of over 98% in discerning human corrections, reduces training time by 27.87% per iteration, and decreases the average number of rounds by approximately 40%. The results indicate that the algorithm, which combines correction detection based on deep learning and a low-dimensional cost function, improves the accuracy of algorithm iterations, reduces computational complexity, and enhances computational speed.",

keywords = "cost function design, error recovery for human correction, Learning from demonstrations (LfD), lightweight network, small-dataset neural network",

author = "Mingchi Zhu and Haoping She and Weiyong Si and Chuanjun Li",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 29th International Conference on Automation and Computing, ICAC 2024 ; Conference date: 28-08-2024 Through 30-08-2024",

year = "2024",

doi = "10.1109/ICAC61394.2024.10718779",

language = "English",

series = "ICAC 2024 - 29th International Conference on Automation and Computing",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "ICAC 2024 - 29th International Conference on Automation and Computing",

address = "United States",

}

Zhu, M, She, H, Si, W & Li, C 2024, Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction. 在 ICAC 2024 - 29th International Conference on Automation and Computing. ICAC 2024 - 29th International Conference on Automation and Computing, Institute of Electrical and Electronics Engineers Inc., 29th International Conference on Automation and Computing, ICAC 2024, Sunderland, 英国, 28/08/24. https://doi.org/10.1109/ICAC61394.2024.10718779

Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction. / Zhu, Mingchi; She, Haoping; Si, Weiyong 等.
ICAC 2024 - 29th International Conference on Automation and Computing. Institute of Electrical and Electronics Engineers Inc., 2024. (ICAC 2024 - 29th International Conference on Automation and Computing).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction

AU - Zhu, Mingchi

AU - She, Haoping

AU - Si, Weiyong

AU - Li, Chuanjun

PY - 2024

Y1 - 2024

N2 - Existing imitation learning methods for human directional corrections may lead to learning incorrect behaviors due to erroneous artificial teaching, resulting in a significant increase in the required number of iterations and even non-convergence situations, which can affect the system's performance. Additionally, the high computational complexity makes it unsuitable for embedded real-time application scenarios. To address these two issues, this study proposes a lightweight imitation learning algorithm that pre-corrects human-directed corrections. This method utilizes a deep learning network trained on a small dataset to correct human directional corrections and designs a lower-dimensional cost function for imitation learning. The proposed approach is applied to the example of a drone passing through doorways. Through the construction of a simulation platform and conducting simulation verification, the results show that the algorithm incorporating the correction error detection mechanism achieves an accuracy of over 98% in discerning human corrections, reduces training time by 27.87% per iteration, and decreases the average number of rounds by approximately 40%. The results indicate that the algorithm, which combines correction detection based on deep learning and a low-dimensional cost function, improves the accuracy of algorithm iterations, reduces computational complexity, and enhances computational speed.

AB - Existing imitation learning methods for human directional corrections may lead to learning incorrect behaviors due to erroneous artificial teaching, resulting in a significant increase in the required number of iterations and even non-convergence situations, which can affect the system's performance. Additionally, the high computational complexity makes it unsuitable for embedded real-time application scenarios. To address these two issues, this study proposes a lightweight imitation learning algorithm that pre-corrects human-directed corrections. This method utilizes a deep learning network trained on a small dataset to correct human directional corrections and designs a lower-dimensional cost function for imitation learning. The proposed approach is applied to the example of a drone passing through doorways. Through the construction of a simulation platform and conducting simulation verification, the results show that the algorithm incorporating the correction error detection mechanism achieves an accuracy of over 98% in discerning human corrections, reduces training time by 27.87% per iteration, and decreases the average number of rounds by approximately 40%. The results indicate that the algorithm, which combines correction detection based on deep learning and a low-dimensional cost function, improves the accuracy of algorithm iterations, reduces computational complexity, and enhances computational speed.

KW - cost function design

KW - error recovery for human correction

KW - Learning from demonstrations (LfD)

KW - lightweight network

KW - small-dataset neural network

UR - http://www.scopus.com/inward/record.url?scp=85208623627&partnerID=8YFLogxK

U2 - 10.1109/ICAC61394.2024.10718779

DO - 10.1109/ICAC61394.2024.10718779

M3 - Conference contribution

AN - SCOPUS:85208623627

T3 - ICAC 2024 - 29th International Conference on Automation and Computing

BT - ICAC 2024 - 29th International Conference on Automation and Computing

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 29th International Conference on Automation and Computing, ICAC 2024

Y2 - 28 August 2024 through 30 August 2024

ER -

Lightweight Imitation Learning Algorithm with Error Recovery for Human Direction Correction

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此