Robust L1 tracker with CNN features

Hongqing Wang; Tingfa Xu

doi:10.1186/s13638-017-0982-4

Robust L1 tracker with CNN features

Hongqing Wang, Tingfa Xu^*

^*Corresponding author for this work

School of Optics and Photonics

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

Abstract

Recently, L1 tracker has been widely applied and received great success in visual tracking. However, most L1 trackers use only the image intensity for sparse representation, which is insufficient to represent the object especially when drastic appearance changes occur. Convolutional neural network (CNN) has demonstrated remarkable capability in a wide range of computer vision fields, and features extracted from different convolutional layers have different characteristics. In this paper, we propose a novel sparse representation model with convolutional features for visual tracking. Besides, to alleviate the redundancy from high-dimensional convolutional features, a feature selection method is adopted to remove noisy and irrelevant feature maps, which can reduce computation redundancy and improve tracking accuracy. Different from traditional sparse representation based tracking methods, our model not only exploits convolutional features to improve the robustness for describing the object appearance but also uses the trivial templates to model both reconstruction errors caused by sparse representation and the eigen-subspace representation. In addition, an unified objective function is proposed and a customized APG method is developed to effectively solve the optimization problem. Numerous qualitative and quantitative evaluations demonstrate that our tracker outperforms other state-of-the-art trackers in a wide range of tracking scenarios.

Original language	English
Article number	194
Journal	Eurasip Journal on Wireless Communications and Networking
Volume	2017
Issue number	1
DOIs	https://doi.org/10.1186/s13638-017-0982-4
Publication status	Published - 1 Dec 2017

Keywords

APG method
CNN features
Sparse representation
Visual tracking

Access to Document

10.1186/s13638-017-0982-4

Cite this

Wang, H., & Xu, T. (2017). Robust L1 tracker with CNN features. Eurasip Journal on Wireless Communications and Networking, 2017(1), Article 194. https://doi.org/10.1186/s13638-017-0982-4

@article{bff20df02bc24355b3584d7d686be785,

title = "Robust L1 tracker with CNN features",

abstract = "Recently, L1 tracker has been widely applied and received great success in visual tracking. However, most L1 trackers use only the image intensity for sparse representation, which is insufficient to represent the object especially when drastic appearance changes occur. Convolutional neural network (CNN) has demonstrated remarkable capability in a wide range of computer vision fields, and features extracted from different convolutional layers have different characteristics. In this paper, we propose a novel sparse representation model with convolutional features for visual tracking. Besides, to alleviate the redundancy from high-dimensional convolutional features, a feature selection method is adopted to remove noisy and irrelevant feature maps, which can reduce computation redundancy and improve tracking accuracy. Different from traditional sparse representation based tracking methods, our model not only exploits convolutional features to improve the robustness for describing the object appearance but also uses the trivial templates to model both reconstruction errors caused by sparse representation and the eigen-subspace representation. In addition, an unified objective function is proposed and a customized APG method is developed to effectively solve the optimization problem. Numerous qualitative and quantitative evaluations demonstrate that our tracker outperforms other state-of-the-art trackers in a wide range of tracking scenarios.",

keywords = "APG method, CNN features, Sparse representation, Visual tracking",

author = "Hongqing Wang and Tingfa Xu",

note = "Publisher Copyright: {\textcopyright} 2017, The Author(s).",

year = "2017",

month = dec,

day = "1",

doi = "10.1186/s13638-017-0982-4",

language = "English",

volume = "2017",

journal = "Eurasip Journal on Wireless Communications and Networking",

issn = "1687-1472",

publisher = "SpringerOpen",

number = "1",

}

TY - JOUR

T1 - Robust L1 tracker with CNN features

AU - Wang, Hongqing

AU - Xu, Tingfa

PY - 2017/12/1

Y1 - 2017/12/1

N2 - Recently, L1 tracker has been widely applied and received great success in visual tracking. However, most L1 trackers use only the image intensity for sparse representation, which is insufficient to represent the object especially when drastic appearance changes occur. Convolutional neural network (CNN) has demonstrated remarkable capability in a wide range of computer vision fields, and features extracted from different convolutional layers have different characteristics. In this paper, we propose a novel sparse representation model with convolutional features for visual tracking. Besides, to alleviate the redundancy from high-dimensional convolutional features, a feature selection method is adopted to remove noisy and irrelevant feature maps, which can reduce computation redundancy and improve tracking accuracy. Different from traditional sparse representation based tracking methods, our model not only exploits convolutional features to improve the robustness for describing the object appearance but also uses the trivial templates to model both reconstruction errors caused by sparse representation and the eigen-subspace representation. In addition, an unified objective function is proposed and a customized APG method is developed to effectively solve the optimization problem. Numerous qualitative and quantitative evaluations demonstrate that our tracker outperforms other state-of-the-art trackers in a wide range of tracking scenarios.

AB - Recently, L1 tracker has been widely applied and received great success in visual tracking. However, most L1 trackers use only the image intensity for sparse representation, which is insufficient to represent the object especially when drastic appearance changes occur. Convolutional neural network (CNN) has demonstrated remarkable capability in a wide range of computer vision fields, and features extracted from different convolutional layers have different characteristics. In this paper, we propose a novel sparse representation model with convolutional features for visual tracking. Besides, to alleviate the redundancy from high-dimensional convolutional features, a feature selection method is adopted to remove noisy and irrelevant feature maps, which can reduce computation redundancy and improve tracking accuracy. Different from traditional sparse representation based tracking methods, our model not only exploits convolutional features to improve the robustness for describing the object appearance but also uses the trivial templates to model both reconstruction errors caused by sparse representation and the eigen-subspace representation. In addition, an unified objective function is proposed and a customized APG method is developed to effectively solve the optimization problem. Numerous qualitative and quantitative evaluations demonstrate that our tracker outperforms other state-of-the-art trackers in a wide range of tracking scenarios.

KW - APG method

KW - CNN features

KW - Sparse representation

KW - Visual tracking

UR - http://www.scopus.com/inward/record.url?scp=85035233964&partnerID=8YFLogxK

U2 - 10.1186/s13638-017-0982-4

DO - 10.1186/s13638-017-0982-4

M3 - Article

AN - SCOPUS:85035233964

SN - 1687-1472

VL - 2017

JO - Eurasip Journal on Wireless Communications and Networking

JF - Eurasip Journal on Wireless Communications and Networking

IS - 1

M1 - 194

ER -

Robust L1 tracker with CNN features

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this