Parallel CNN-Transformer Dual-branch Hybrid Tracker

Chenxi Li, Yongqiang Bai

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Currently, object tracking methods are increasingly adopting Transformer-based models to replace the earlier convolutional neural networks (CNNs). While these methods have achieved performance improvements, they tend to overlook the importance of local feature details in visual tasks due to the global modeling nature of Transformers. In this paper, we propose a parallel CNN-Transformer dual-branch hybrid tracking model (PCTTrack). By designing a feature fusion module with various attention mechanisms and an improved prediction head, the model effectively leverages both local and global information advantages. Experiments show that our method achieves competitive results on multiple object tracking datasets. For instance, it achieves an AO of 75.5 on the GOT-10K dataset. Compared to the single Transformer branch, the hybrid model improves the AUC on LaSOT by 3.8% and the AO on GOT-10K by 2.6%. Additionally, through visualizing the outputs of different model structures, we validate the effectiveness of the dual-branch fusion model.

Original languageEnglish
Title of host publicationProceedings - 2024 China Automation Congress, CAC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages3945-3950
Number of pages6
ISBN (Electronic)9798350368604
DOIs
Publication statusPublished - 2024
Event2024 China Automation Congress, CAC 2024 - Qingdao, China
Duration: 1 Nov 20243 Nov 2024

Publication series

NameProceedings - 2024 China Automation Congress, CAC 2024

Conference

Conference2024 China Automation Congress, CAC 2024
Country/TerritoryChina
CityQingdao
Period1/11/243/11/24

Keywords

  • CNN
  • Feature fusion
  • Object tracking
  • Prediction head
  • Vision Transformer

Fingerprint

Dive into the research topics of 'Parallel CNN-Transformer Dual-branch Hybrid Tracker'. Together they form a unique fingerprint.

Cite this

Li, C., & Bai, Y. (2024). Parallel CNN-Transformer Dual-branch Hybrid Tracker. In Proceedings - 2024 China Automation Congress, CAC 2024 (pp. 3945-3950). (Proceedings - 2024 China Automation Congress, CAC 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CAC63892.2024.10865120