Discriminative tracking using tensor pooling

Bo Ma; Lianghua Huang; Jianbing Shen; Ling Shao

doi:10.1109/TCYB.2015.2477879

Discriminative tracking using tensor pooling

Bo Ma, Lianghua Huang, Jianbing Shen^*, Ling Shao

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

55 Citations (Scopus)

Abstract

How to effectively organize local descriptors to build a global representation has a critical impact on the performance of vision tasks. Recently, local sparse representation has been successfully applied to visual tracking, owing to its discriminative nature and robustness against local noise and partial occlusions. Local sparse codes computed with a template actually form a three-order tensor according to their original layout, although most existing pooling operators convert the codes to a vector by concatenating or computing statistics on them. We argue that, compared to pooling vectors, the tensor form could deliver more intrinsic structural information for the target appearance, and can also avoid high dimensionality learning problems suffered in concatenation-based pooling methods. Therefore, in this paper, we propose to represent target templates and candidates directly with sparse coding tensors, and build the appearance model by incrementally learning on these tensors. We propose a discriminative framework to further improve robustness of our method against drifting and environmental noise. Experiments on a recent comprehensive benchmark indicate that our method performs better than state-of-the-art trackers.

Original language	English
Article number	2477879
Pages (from-to)	2411-2422
Number of pages	12
Journal	IEEE Transactions on Cybernetics
Volume	46
Issue number	10
DOIs	https://doi.org/10.1109/TCYB.2015.2477879
Publication status	Published - 28 Sept 2015

Keywords

Discriminative
Sparse representation
Subspace
Tensor pooling
Tracking

Access to Document

10.1109/TCYB.2015.2477879

Cite this

@article{0e803d88e1174bd9b84ccaa69f211ae4,

title = "Discriminative tracking using tensor pooling",

abstract = "How to effectively organize local descriptors to build a global representation has a critical impact on the performance of vision tasks. Recently, local sparse representation has been successfully applied to visual tracking, owing to its discriminative nature and robustness against local noise and partial occlusions. Local sparse codes computed with a template actually form a three-order tensor according to their original layout, although most existing pooling operators convert the codes to a vector by concatenating or computing statistics on them. We argue that, compared to pooling vectors, the tensor form could deliver more intrinsic structural information for the target appearance, and can also avoid high dimensionality learning problems suffered in concatenation-based pooling methods. Therefore, in this paper, we propose to represent target templates and candidates directly with sparse coding tensors, and build the appearance model by incrementally learning on these tensors. We propose a discriminative framework to further improve robustness of our method against drifting and environmental noise. Experiments on a recent comprehensive benchmark indicate that our method performs better than state-of-the-art trackers.",

keywords = "Discriminative, Sparse representation, Subspace, Tensor pooling, Tracking",

author = "Bo Ma and Lianghua Huang and Jianbing Shen and Ling Shao",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.",

year = "2015",

month = sep,

day = "28",

doi = "10.1109/TCYB.2015.2477879",

language = "English",

volume = "46",

pages = "2411--2422",

journal = "IEEE Transactions on Cybernetics",

issn = "2168-2267",

publisher = "IEEE Advancing Technology for Humanity",

number = "10",

}

TY - JOUR

T1 - Discriminative tracking using tensor pooling

AU - Ma, Bo

AU - Huang, Lianghua

AU - Shen, Jianbing

AU - Shao, Ling

PY - 2015/9/28

Y1 - 2015/9/28

N2 - How to effectively organize local descriptors to build a global representation has a critical impact on the performance of vision tasks. Recently, local sparse representation has been successfully applied to visual tracking, owing to its discriminative nature and robustness against local noise and partial occlusions. Local sparse codes computed with a template actually form a three-order tensor according to their original layout, although most existing pooling operators convert the codes to a vector by concatenating or computing statistics on them. We argue that, compared to pooling vectors, the tensor form could deliver more intrinsic structural information for the target appearance, and can also avoid high dimensionality learning problems suffered in concatenation-based pooling methods. Therefore, in this paper, we propose to represent target templates and candidates directly with sparse coding tensors, and build the appearance model by incrementally learning on these tensors. We propose a discriminative framework to further improve robustness of our method against drifting and environmental noise. Experiments on a recent comprehensive benchmark indicate that our method performs better than state-of-the-art trackers.

AB - How to effectively organize local descriptors to build a global representation has a critical impact on the performance of vision tasks. Recently, local sparse representation has been successfully applied to visual tracking, owing to its discriminative nature and robustness against local noise and partial occlusions. Local sparse codes computed with a template actually form a three-order tensor according to their original layout, although most existing pooling operators convert the codes to a vector by concatenating or computing statistics on them. We argue that, compared to pooling vectors, the tensor form could deliver more intrinsic structural information for the target appearance, and can also avoid high dimensionality learning problems suffered in concatenation-based pooling methods. Therefore, in this paper, we propose to represent target templates and candidates directly with sparse coding tensors, and build the appearance model by incrementally learning on these tensors. We propose a discriminative framework to further improve robustness of our method against drifting and environmental noise. Experiments on a recent comprehensive benchmark indicate that our method performs better than state-of-the-art trackers.

KW - Discriminative

KW - Sparse representation

KW - Subspace

KW - Tensor pooling

KW - Tracking

UR - http://www.scopus.com/inward/record.url?scp=84943172349&partnerID=8YFLogxK

U2 - 10.1109/TCYB.2015.2477879

DO - 10.1109/TCYB.2015.2477879

M3 - Article

AN - SCOPUS:84943172349

SN - 2168-2267

VL - 46

SP - 2411

EP - 2422

JO - IEEE Transactions on Cybernetics

JF - IEEE Transactions on Cybernetics

IS - 10

M1 - 2477879

ER -

Discriminative tracking using tensor pooling

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this