Temporal-Aware Visual Object Tracking with Pyramidal Transformer and Adaptive Decoupling

Yiding Liang, Bo Ma*, Hao Xu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

We propose a one-stream visual object tracking algorithm PVTrack. First, we propose a one-stream pyramidal backbone based on the attention mechanism, which computes the template and search region in parallel to improve the computational efficiency of the tracker, and in which the attention mechanism establishes global contextual information to optimize the tracking performance. Secondly, we propose an adaptive decoupled prediction head, which performs targeted computation on different layers of features output from the backbone: for the low-level semantic features that are rich in target shape information, feature fusion is used to improve the regression accuracy of the model; for the high-level semantic information that is good for classification, classification regression decoupling is used to improve the target localization accuracy by utilizing the high-level semantic features alone. Finally, we introduce the discriminative template updating method and design the template updating threshold function, so as to improve the algorithm's ability of modeling temporal information. In this paper, tests and ablation experiments are conducted on multiple datasets to verify that the proposed one-stream visual object tracking algorithm based on discriminative template updating can effectively improve the computational efficiency and robustness of tracking.

源语言英语
主期刊名2024 5th International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2024
出版商Institute of Electrical and Electronics Engineers Inc.
270-277
页数8
ISBN(电子版)9798350366174
DOI
出版状态已出版 - 2024
活动5th International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2024 - Shenzhen, 中国
期限: 14 6月 202416 6月 2024

出版系列

姓名2024 5th International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2024

会议

会议5th International Conference on Artificial Intelligence and Electromechanical Automation, AIEA 2024
国家/地区中国
Shenzhen
时期14/06/2416/06/24

指纹

探究 'Temporal-Aware Visual Object Tracking with Pyramidal Transformer and Adaptive Decoupling' 的科研主题。它们共同构成独一无二的指纹。

引用此