Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning

Zheng Li; Junjie Zhou; Xueyuan Li; Xu Du; Lei Wang; Yun Wang

doi:10.1109/ICUS50048.2020.9274962

Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning

Zheng Li, Junjie Zhou, Xueyuan Li, Xu Du, Lei Wang, Yun Wang

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

6 引用（Scopus）

摘要

Skid Steering vehicles are being widely used due to their robust mechanical structure and high maneuverability. Moving object tracking for unmanned skid-steered vehicle (USSV) is a challenging task that requires delicate actions to ensure a smooth trajectory and accurate response between ego vehicle and the moving object. However, inevitable slipping and sliding of the tire that makes the vehicle difficult to control and accurate model of USSV are hard to describe. This paper proposes a real-time moving object tracking system with continuous actions for USSV base on a reinforcement learning algorithm named Twin Delay Deterministic Policy Gradient (TD3). The capacity of the replay buffer, which is critical in the training process, changes softly as the training episodes increases. We added two control group models with a fixed capacity of replay buffer and trained the RL agent from scratch in the gazebo environment. By observing the training and validation results, we can conclude that our RL model performs well for moving target tracking, and the model with soft updated replay buffer has high efficiency in the training process and high accuracy in the evaluation process.

源语言	英语
主期刊名	Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020
出版商	Institute of Electrical and Electronics Engineers Inc.
页	456-461
页数	6
ISBN（电子版）	9781728180250
DOI	https://doi.org/10.1109/ICUS50048.2020.9274962
出版状态	已出版 - 27 11月 2020
活动	3rd International Conference on Unmanned Systems, ICUS 2020 - Harbin, 中国期限: 27 11月 2020 → 28 11月 2020

出版系列

姓名	Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020

会议

会议	3rd International Conference on Unmanned Systems, ICUS 2020
国家/地区	中国
市	Harbin
时期	27/11/20 → 28/11/20

访问文件

10.1109/ICUS50048.2020.9274962

其它文件与链接

链接到 Scopus 的出版物

引用此

Li, Z., Zhou, J., Li, X., Du, X., Wang, L., & Wang, Y. (2020). Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning. 在 Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020 (页码 456-461). 文章 9274962 (Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICUS50048.2020.9274962

Li, Zheng ; Zhou, Junjie ; Li, Xueyuan 等. / Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning. Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020. Institute of Electrical and Electronics Engineers Inc., 2020. 页码 456-461 (Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020).

@inproceedings{f82ebdc0c15049a9a5d527aca81c0604,

title = "Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning",

abstract = "Skid Steering vehicles are being widely used due to their robust mechanical structure and high maneuverability. Moving object tracking for unmanned skid-steered vehicle (USSV) is a challenging task that requires delicate actions to ensure a smooth trajectory and accurate response between ego vehicle and the moving object. However, inevitable slipping and sliding of the tire that makes the vehicle difficult to control and accurate model of USSV are hard to describe. This paper proposes a real-time moving object tracking system with continuous actions for USSV base on a reinforcement learning algorithm named Twin Delay Deterministic Policy Gradient (TD3). The capacity of the replay buffer, which is critical in the training process, changes softly as the training episodes increases. We added two control group models with a fixed capacity of replay buffer and trained the RL agent from scratch in the gazebo environment. By observing the training and validation results, we can conclude that our RL model performs well for moving target tracking, and the model with soft updated replay buffer has high efficiency in the training process and high accuracy in the evaluation process.",

keywords = "Continuous control, Reinforcement learning, USSV object tracking, Unmanned skid-steered vehicle",

author = "Zheng Li and Junjie Zhou and Xueyuan Li and Xu Du and Lei Wang and Yun Wang",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 3rd International Conference on Unmanned Systems, ICUS 2020 ; Conference date: 27-11-2020 Through 28-11-2020",

year = "2020",

month = nov,

day = "27",

doi = "10.1109/ICUS50048.2020.9274962",

language = "English",

series = "Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "456--461",

booktitle = "Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020",

address = "United States",

}

Li, Z, Zhou, J, Li, X, Du, X, Wang, L & Wang, Y 2020, Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning. 在 Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020., 9274962, Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020, Institute of Electrical and Electronics Engineers Inc., 页码 456-461, 3rd International Conference on Unmanned Systems, ICUS 2020, Harbin, 中国, 27/11/20. https://doi.org/10.1109/ICUS50048.2020.9274962

Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning. / Li, Zheng; Zhou, Junjie; Li, Xueyuan 等.
Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020. Institute of Electrical and Electronics Engineers Inc., 2020. 页码 456-461 9274962 (Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning

AU - Li, Zheng

AU - Zhou, Junjie

AU - Li, Xueyuan

AU - Du, Xu

AU - Wang, Lei

AU - Wang, Yun

PY - 2020/11/27

Y1 - 2020/11/27

N2 - Skid Steering vehicles are being widely used due to their robust mechanical structure and high maneuverability. Moving object tracking for unmanned skid-steered vehicle (USSV) is a challenging task that requires delicate actions to ensure a smooth trajectory and accurate response between ego vehicle and the moving object. However, inevitable slipping and sliding of the tire that makes the vehicle difficult to control and accurate model of USSV are hard to describe. This paper proposes a real-time moving object tracking system with continuous actions for USSV base on a reinforcement learning algorithm named Twin Delay Deterministic Policy Gradient (TD3). The capacity of the replay buffer, which is critical in the training process, changes softly as the training episodes increases. We added two control group models with a fixed capacity of replay buffer and trained the RL agent from scratch in the gazebo environment. By observing the training and validation results, we can conclude that our RL model performs well for moving target tracking, and the model with soft updated replay buffer has high efficiency in the training process and high accuracy in the evaluation process.

AB - Skid Steering vehicles are being widely used due to their robust mechanical structure and high maneuverability. Moving object tracking for unmanned skid-steered vehicle (USSV) is a challenging task that requires delicate actions to ensure a smooth trajectory and accurate response between ego vehicle and the moving object. However, inevitable slipping and sliding of the tire that makes the vehicle difficult to control and accurate model of USSV are hard to describe. This paper proposes a real-time moving object tracking system with continuous actions for USSV base on a reinforcement learning algorithm named Twin Delay Deterministic Policy Gradient (TD3). The capacity of the replay buffer, which is critical in the training process, changes softly as the training episodes increases. We added two control group models with a fixed capacity of replay buffer and trained the RL agent from scratch in the gazebo environment. By observing the training and validation results, we can conclude that our RL model performs well for moving target tracking, and the model with soft updated replay buffer has high efficiency in the training process and high accuracy in the evaluation process.

KW - Continuous control

KW - Reinforcement learning

KW - USSV object tracking

KW - Unmanned skid-steered vehicle

UR - http://www.scopus.com/inward/record.url?scp=85098982009&partnerID=8YFLogxK

U2 - 10.1109/ICUS50048.2020.9274962

DO - 10.1109/ICUS50048.2020.9274962

M3 - Conference contribution

AN - SCOPUS:85098982009

T3 - Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020

SP - 456

EP - 461

BT - Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 3rd International Conference on Unmanned Systems, ICUS 2020

Y2 - 27 November 2020 through 28 November 2020

ER -

Li Z, Zhou J, Li X, Du X, Wang L, Wang Y. Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning. 在 Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020. Institute of Electrical and Electronics Engineers Inc. 2020. 页码 456-461. 9274962. (Proceedings of 2020 3rd International Conference on Unmanned Systems, ICUS 2020). doi: 10.1109/ICUS50048.2020.9274962

Continuous control for moving object tracking of unmanned skid-steered vehicle based on reinforcement learning

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此