Tensor pooling for online visual tracking

Lianghua Huang, Bo Ma

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

Recently, local sparse representation (LSR) has been successfully applied in visual tracking, owing to its discriminative nature and robustness against local noise and occlusions. It is note worthy that local sparse codes computed with a template form a 3-order tensor of their original layout, although most pooling operators convert it to a vector by concatenating or computing statistics on it. As compared to pooling vectors, tensor form could deliver more informative and structured representation for target appearance, and can also avoid high dimensionality learning problem suffered in concatenating pooling based methods. Motivated by above ideas, in this paper, we propose to represent target templates directly with sparse coding tensors, and build the appearance model by incrementally learning on these tensors. We further propose a discriminative framework to improve robustness against drifting and environment noise. Experiments on a recent comprehensive benchmark indicate that our method outperforms state-of-the-art trackers.

Original languageEnglish
Title of host publication2015 IEEE International Conference on Multimedia and Expo, ICME 2015
PublisherIEEE Computer Society
ISBN (Electronic)9781479970827
DOIs
Publication statusPublished - 4 Aug 2015
EventIEEE International Conference on Multimedia and Expo, ICME 2015 - Turin, Italy
Duration: 29 Jun 20153 Jul 2015

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2015-August
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

ConferenceIEEE International Conference on Multimedia and Expo, ICME 2015
Country/TerritoryItaly
CityTurin
Period29/06/153/07/15

Keywords

  • Tracking
  • sparse representation
  • tensor Pooling
  • tensor subspace learning

Fingerprint

Dive into the research topics of 'Tensor pooling for online visual tracking'. Together they form a unique fingerprint.

Cite this