TY - GEN
T1 - Visual tracking with kernelized correlation filters based on multiple features
AU - Cai, Zhi
AU - Dong, Liquan
AU - Liu, Ming
AU - Zhao, Yuejin
AU - Du, Haoyuan
AU - Yuan, Ruifeng
AU - Ma, Feilong
N1 - Publisher Copyright:
© 2018 SPIE.
PY - 2018
Y1 - 2018
N2 - Visual object tracking plays a significant role in our daily life such as intelligent transportation and surveillance. However, an accurate and robust object tracker is hard to be obtained as target objects often go through huge appearance changes caused by deformation, abrupt motion, background clutter and occlusion. In this paper, we combine features extracted from deep convolutional neural networks pretrained on object recognition datasets with color name features and histogram of oriented gradient features skillfully to improve tracking accuracy and robustness. The outputs of the convolutional layers encode the senior semantic information of targets and such representations are robust to great appearance variations while their spatial resolution is too coarse to precisely locate targets. In contrast, color name features connected at the back of HOG features could provide more precise localization but are less invariant to appearance changes. We first infer the response of the convolutional features and HOG-CN features respectively, then make a linear combination of them. The maximum value of the result could represent the accurate localization of the target. We not only compare the tracking results of adopting a single feature alone, showing that the performance of them is inferior to ours, but also analyze the effect of exploiting features extracted from different convolutional layers on the tracking performance. What's more, we introduce the adaptive target response map in our tracking algorithm to keep the target from drifting as much as possible. Extensive experimental results on a large scale benchmark dataset illustrates outstanding performance of the proposed algorithm.
AB - Visual object tracking plays a significant role in our daily life such as intelligent transportation and surveillance. However, an accurate and robust object tracker is hard to be obtained as target objects often go through huge appearance changes caused by deformation, abrupt motion, background clutter and occlusion. In this paper, we combine features extracted from deep convolutional neural networks pretrained on object recognition datasets with color name features and histogram of oriented gradient features skillfully to improve tracking accuracy and robustness. The outputs of the convolutional layers encode the senior semantic information of targets and such representations are robust to great appearance variations while their spatial resolution is too coarse to precisely locate targets. In contrast, color name features connected at the back of HOG features could provide more precise localization but are less invariant to appearance changes. We first infer the response of the convolutional features and HOG-CN features respectively, then make a linear combination of them. The maximum value of the result could represent the accurate localization of the target. We not only compare the tracking results of adopting a single feature alone, showing that the performance of them is inferior to ours, but also analyze the effect of exploiting features extracted from different convolutional layers on the tracking performance. What's more, we introduce the adaptive target response map in our tracking algorithm to keep the target from drifting as much as possible. Extensive experimental results on a large scale benchmark dataset illustrates outstanding performance of the proposed algorithm.
KW - adaptive target response
KW - convolutional features
KW - correlation filters
KW - multiple features
KW - visual tracking
UR - http://www.scopus.com/inward/record.url?scp=85054609227&partnerID=8YFLogxK
U2 - 10.1117/12.2319715
DO - 10.1117/12.2319715
M3 - Conference contribution
AN - SCOPUS:85054609227
SN - 9781510620735
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Optics and Photonics for Information Processing XII
A2 - Iftekharuddin, Khan M.
A2 - Diaz-Ramirez, Victor H.
A2 - Vazquez, Mireya Garcia
A2 - Awwal, Abdul A. S.
A2 - Marquez, Andres
PB - SPIE
T2 - Optics and Photonics for Information Processing XII 2018
Y2 - 19 August 2018 through 20 August 2018
ER -