Observation-Time-Action Deep Stacking Strategy: Solving Partial Observability Problems with Visual Input

Keyang Jiang, Qiang Wang*, Yahao Xu, Hongbin Deng

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Reinforcement learning tasks that involve visual input continue to pose a challenge when it comes to partial observability problems. Although prior research has introduced methods such as LSTM, GTrXL, and DNC, each of these approaches possesses its own limitations. To address the problem of partial observability in a more universal context, this paper proposed the Observation-Time-Action deep stacking algorithm. First, observations, actions, and time data were combined into a tuple and then stacked into a longer sequence. Then, convolution and fully connected layers were utilized to extract relevant features from the sequence, which were then fed into the algorithm for processing. We designed a number of experiments with partial observability, corresponding to different typical scenarios in reinforcement learning. The experiment results demonstrated that the proposed method obtained a higher success rate. Moreover, we also investigated the effect of stacking frame length and different reinforcement learning elements on the algorithm. Finally, we conducted a HITL (Hardware-in-the-Loop) experiment to further verify the effectiveness of our algorithm.

源语言英语
主期刊名2024 International Joint Conference on Neural Networks, IJCNN 2024 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350359312
DOI
出版状态已出版 - 2024
活动2024 International Joint Conference on Neural Networks, IJCNN 2024 - Yokohama, 日本
期限: 30 6月 20245 7月 2024

出版系列

姓名Proceedings of the International Joint Conference on Neural Networks

会议

会议2024 International Joint Conference on Neural Networks, IJCNN 2024
国家/地区日本
Yokohama
时期30/06/245/07/24

指纹

探究 'Observation-Time-Action Deep Stacking Strategy: Solving Partial Observability Problems with Visual Input' 的科研主题。它们共同构成独一无二的指纹。

引用此

Jiang, K., Wang, Q., Xu, Y., & Deng, H. (2024). Observation-Time-Action Deep Stacking Strategy: Solving Partial Observability Problems with Visual Input. 在 2024 International Joint Conference on Neural Networks, IJCNN 2024 - Proceedings (Proceedings of the International Joint Conference on Neural Networks). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN60899.2024.10650736