Deep Siamese Cross-Residual Learning for Robust Visual Tracking

Fan Wu, Tingfa Xu*, Jie Guo, Bo Huang, Chang Xu, Jihui Wang, Xiangmin Li

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

The sixth-generation (6G) wireless technology contributes to the establishment of the Internet of Things (IoT). Recently, the IoT has become popular because of its smart architectures and various applications. Among these applications, intelligent urban surveillance systems for smart cities are becoming more and more important. Therefore, designing a robust visual tracking method has become an urgent task. Deep Siamese convolutional neural networks have been applied to visual tracking recently because of their advantageous abilities to learn a matching function between the template and the target candidate. Unlike traditional Siamese networks, which separately treat the two branches, we propose deep Siamese cross-residual learning to entangle the two branches from the beginning to the end of the Siamese network. This strategy can make the two branches exchange instance-specific information at different nodes of the network and learn a more compact representation of the target. In addition, we propose a combined loss function, which consists of two complementary tasks. One task is to learn a matching function directly and the other one is to learn a classification function. Moreover, our model does not need to load any pretrained weights and is trained with limited sequences from scratch. Plenty of experiments show that our tracker performs favorably against many state-of-the-art tracking methods.

Original languageEnglish
Pages (from-to)15216-15227
Number of pages12
JournalIEEE Internet of Things Journal
Volume8
Issue number20
DOIs
Publication statusPublished - 15 Oct 2021

Keywords

  • Convolutional neural network (CNN)
  • Internet of Things (IoT)
  • Siamese cross-residual learning
  • deep learning
  • visual tracking

Fingerprint

Dive into the research topics of 'Deep Siamese Cross-Residual Learning for Robust Visual Tracking'. Together they form a unique fingerprint.

Cite this