Full Attention Tracker: A Good Combination of Pixel-Level and Region-Level Cross-Correlation

Yuxuan Wang, Liping Yan*, Zihang Feng, Yuanqing Xia, Bo Xiao

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The tracker based on Siamese neural network is currently a technical method with high accuracy in the tracking field. With the introduction of transformer in the visual tracking field, the attention mechanism has gradually emerged in tracking tasks. However, due to the characteristics of attention operation, Transformer usually has slow convergence speed, and its pixel-level correlation discrimination in tracking is more likely to lead to overfitting, which is not conducive to long-term tracking. A brand new framework FAT was designed, which is the improvement of MixFormer. The operation for simultaneous feature extraction and target information integration in MixFormer is retained, and the Mixing block is introduced to suppress the background as much as possible before the information interaction. In addition, a new operation is designed: the result of region-level cross-correlation is used as a guidance to help the learning of pixel-level cross-correlation in attention, thereby accelerating the model convergence speed and enhancing the model generalization. Finally, a joint loss function is designed to further improve the accuracy of the model. Experiments show that the presented tracker achieves excellent performance on five benchmark datasets.

源语言英语
主期刊名2023 42nd Chinese Control Conference, CCC 2023
出版商IEEE Computer Society
7440-7446
页数7
ISBN(电子版)9789887581543
DOI
出版状态已出版 - 2023
活动42nd Chinese Control Conference, CCC 2023 - Tianjin, 中国
期限: 24 7月 202326 7月 2023

出版系列

姓名Chinese Control Conference, CCC
2023-July
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议42nd Chinese Control Conference, CCC 2023
国家/地区中国
Tianjin
时期24/07/2326/07/23

指纹

探究 'Full Attention Tracker: A Good Combination of Pixel-Level and Region-Level Cross-Correlation' 的科研主题。它们共同构成独一无二的指纹。

引用此