Deep Ensemble Tracking

Jie Guo; Tingfa Xu

doi:10.1109/LSP.2017.2749458

Deep Ensemble Tracking

Jie Guo, Tingfa Xu^*

^*Corresponding author for this work

School of Optics and Photonics

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

11 Citations (Scopus)

Abstract

In this letter, we cast visual tracking as a template matching problem in a Siamese deep convolutional neural network architecture. In contrast to traditional or other deep feature-based tracking methods, the proposed model exploits multilevel convolutional features from a partial view. The model matches candidate patch and template patch from the feature dimension of convolutional features, leading to hundreds of thousands of base matchers. The base matchers from low-level convolutional features have small receptive fields which contain partial details of targets while the base matchers from high-level convolutional features have big receptive fields which capture semantic information of targets. The model achieves the final strong matcher as a weighted ensemble of all the base matchers. We design an effective weights propagation strategy to update the weights of base matchers. Moreover, we propose to use Cosine as the distance metric and a customized squared-loss function as cost function for robust. Experiments show that our tracker outperforms the state-of-the-art trackers in a wide range of tracking scenarios.

Original language	English
Article number	8026140
Pages (from-to)	1562-1566
Number of pages	5
Journal	IEEE Signal Processing Letters
Volume	24
Issue number	10
DOIs	https://doi.org/10.1109/LSP.2017.2749458
Publication status	Published - Oct 2017

Keywords

Convolutional neural network (CNN)
Siamese neural network
ensemble tracking
template matching

Access to Document

10.1109/LSP.2017.2749458

Cite this

@article{355ce38fdfe746e2922338e454d1577f,

title = "Deep Ensemble Tracking",

abstract = "In this letter, we cast visual tracking as a template matching problem in a Siamese deep convolutional neural network architecture. In contrast to traditional or other deep feature-based tracking methods, the proposed model exploits multilevel convolutional features from a partial view. The model matches candidate patch and template patch from the feature dimension of convolutional features, leading to hundreds of thousands of base matchers. The base matchers from low-level convolutional features have small receptive fields which contain partial details of targets while the base matchers from high-level convolutional features have big receptive fields which capture semantic information of targets. The model achieves the final strong matcher as a weighted ensemble of all the base matchers. We design an effective weights propagation strategy to update the weights of base matchers. Moreover, we propose to use Cosine as the distance metric and a customized squared-loss function as cost function for robust. Experiments show that our tracker outperforms the state-of-the-art trackers in a wide range of tracking scenarios.",

keywords = "Convolutional neural network (CNN), Siamese neural network, ensemble tracking, template matching",

author = "Jie Guo and Tingfa Xu",

note = "Publisher Copyright: {\textcopyright} 1994-2012 IEEE.",

year = "2017",

month = oct,

doi = "10.1109/LSP.2017.2749458",

language = "English",

volume = "24",

pages = "1562--1566",

journal = "IEEE Signal Processing Letters",

issn = "1070-9908",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "10",

}

TY - JOUR

T1 - Deep Ensemble Tracking

AU - Guo, Jie

AU - Xu, Tingfa

PY - 2017/10

Y1 - 2017/10

N2 - In this letter, we cast visual tracking as a template matching problem in a Siamese deep convolutional neural network architecture. In contrast to traditional or other deep feature-based tracking methods, the proposed model exploits multilevel convolutional features from a partial view. The model matches candidate patch and template patch from the feature dimension of convolutional features, leading to hundreds of thousands of base matchers. The base matchers from low-level convolutional features have small receptive fields which contain partial details of targets while the base matchers from high-level convolutional features have big receptive fields which capture semantic information of targets. The model achieves the final strong matcher as a weighted ensemble of all the base matchers. We design an effective weights propagation strategy to update the weights of base matchers. Moreover, we propose to use Cosine as the distance metric and a customized squared-loss function as cost function for robust. Experiments show that our tracker outperforms the state-of-the-art trackers in a wide range of tracking scenarios.

AB - In this letter, we cast visual tracking as a template matching problem in a Siamese deep convolutional neural network architecture. In contrast to traditional or other deep feature-based tracking methods, the proposed model exploits multilevel convolutional features from a partial view. The model matches candidate patch and template patch from the feature dimension of convolutional features, leading to hundreds of thousands of base matchers. The base matchers from low-level convolutional features have small receptive fields which contain partial details of targets while the base matchers from high-level convolutional features have big receptive fields which capture semantic information of targets. The model achieves the final strong matcher as a weighted ensemble of all the base matchers. We design an effective weights propagation strategy to update the weights of base matchers. Moreover, we propose to use Cosine as the distance metric and a customized squared-loss function as cost function for robust. Experiments show that our tracker outperforms the state-of-the-art trackers in a wide range of tracking scenarios.

KW - Convolutional neural network (CNN)

KW - Siamese neural network

KW - ensemble tracking

KW - template matching

UR - http://www.scopus.com/inward/record.url?scp=85029168350&partnerID=8YFLogxK

U2 - 10.1109/LSP.2017.2749458

DO - 10.1109/LSP.2017.2749458

M3 - Article

AN - SCOPUS:85029168350

SN - 1070-9908

VL - 24

SP - 1562

EP - 1566

JO - IEEE Signal Processing Letters

JF - IEEE Signal Processing Letters

IS - 10

M1 - 8026140

ER -

Deep Ensemble Tracking

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this