Online visual tracking with high-order pooling

Xiyu Yan; Bo Ma

doi:10.1109/ICME.2017.8019349

Online visual tracking with high-order pooling

Xiyu Yan, Bo Ma

计算机学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Most local sparse representation models in visual tracking generally contain three components: 1) extracting local descriptors from target region, 2) encoding the extracted local descriptors as mid-level features, 3) aggregating statistics of mid-level features into a signature. Since the last step aggregates only first-order statistics of mid-level features, it is named as First-order Pooling (FP). However, FP lacks highorder statistical information of target. Hence, it couldn't reflect the correlation of features, which leads to poor tracking performance. In this paper, we introduce an appearance model for visual tracking that conducts High-order Pooling (HP) over mid-level features under the framework of sparse coding. Instead of first-order signature, we find that higher-order statistics of mid-level features with additional image information could bring large tracking performance gains. Moreover, a simple but effective updating scheme is adopted to improve the tracker adaptability. Experiments on various challenging videos show that the tracking performance with appearance model using HP is superior to those using FP.

源语言	英语
主期刊名	2017 IEEE International Conference on Multimedia and Expo, ICME 2017
出版商	IEEE Computer Society
页	289-294
页数	6
ISBN（电子版）	9781509060672
DOI	https://doi.org/10.1109/ICME.2017.8019349
出版状态	已出版 - 28 8月 2017
活动	2017 IEEE International Conference on Multimedia and Expo, ICME 2017 - Hong Kong, 香港期限: 10 7月 2017 → 14 7月 2017

出版系列

姓名	Proceedings - IEEE International Conference on Multimedia and Expo
ISSN（印刷版）	1945-7871
ISSN（电子版）	1945-788X

会议

会议	2017 IEEE International Conference on Multimedia and Expo, ICME 2017
国家/地区	香港
市	Hong Kong
时期	10/07/17 → 14/07/17

访问文件

10.1109/ICME.2017.8019349

其它文件与链接

链接到 Scopus 的出版物

引用此

Yan, X., & Ma, B. (2017). Online visual tracking with high-order pooling. 在 2017 IEEE International Conference on Multimedia and Expo, ICME 2017 (页码 289-294). 文章 8019349 (Proceedings - IEEE International Conference on Multimedia and Expo). IEEE Computer Society. https://doi.org/10.1109/ICME.2017.8019349

@inproceedings{b9fd5dcc912a425489aa6f8d88ab5fe7,

title = "Online visual tracking with high-order pooling",

abstract = "Most local sparse representation models in visual tracking generally contain three components: 1) extracting local descriptors from target region, 2) encoding the extracted local descriptors as mid-level features, 3) aggregating statistics of mid-level features into a signature. Since the last step aggregates only first-order statistics of mid-level features, it is named as First-order Pooling (FP). However, FP lacks highorder statistical information of target. Hence, it couldn't reflect the correlation of features, which leads to poor tracking performance. In this paper, we introduce an appearance model for visual tracking that conducts High-order Pooling (HP) over mid-level features under the framework of sparse coding. Instead of first-order signature, we find that higher-order statistics of mid-level features with additional image information could bring large tracking performance gains. Moreover, a simple but effective updating scheme is adopted to improve the tracker adaptability. Experiments on various challenging videos show that the tracking performance with appearance model using HP is superior to those using FP.",

keywords = "High-order Pooling, Mid-level features, Object tracking, Sparse coding",

author = "Xiyu Yan and Bo Ma",

note = "Publisher Copyright: {\textcopyright} 2017 IEEE.; 2017 IEEE International Conference on Multimedia and Expo, ICME 2017 ; Conference date: 10-07-2017 Through 14-07-2017",

year = "2017",

month = aug,

day = "28",

doi = "10.1109/ICME.2017.8019349",

language = "English",

series = "Proceedings - IEEE International Conference on Multimedia and Expo",

publisher = "IEEE Computer Society",

pages = "289--294",

booktitle = "2017 IEEE International Conference on Multimedia and Expo, ICME 2017",

address = "United States",

}

Yan, X & Ma, B 2017, Online visual tracking with high-order pooling. 在 2017 IEEE International Conference on Multimedia and Expo, ICME 2017., 8019349, Proceedings - IEEE International Conference on Multimedia and Expo, IEEE Computer Society, 页码 289-294, 2017 IEEE International Conference on Multimedia and Expo, ICME 2017, Hong Kong, 香港, 10/07/17. https://doi.org/10.1109/ICME.2017.8019349

TY - GEN

T1 - Online visual tracking with high-order pooling

AU - Yan, Xiyu

AU - Ma, Bo

PY - 2017/8/28

Y1 - 2017/8/28

N2 - Most local sparse representation models in visual tracking generally contain three components: 1) extracting local descriptors from target region, 2) encoding the extracted local descriptors as mid-level features, 3) aggregating statistics of mid-level features into a signature. Since the last step aggregates only first-order statistics of mid-level features, it is named as First-order Pooling (FP). However, FP lacks highorder statistical information of target. Hence, it couldn't reflect the correlation of features, which leads to poor tracking performance. In this paper, we introduce an appearance model for visual tracking that conducts High-order Pooling (HP) over mid-level features under the framework of sparse coding. Instead of first-order signature, we find that higher-order statistics of mid-level features with additional image information could bring large tracking performance gains. Moreover, a simple but effective updating scheme is adopted to improve the tracker adaptability. Experiments on various challenging videos show that the tracking performance with appearance model using HP is superior to those using FP.

AB - Most local sparse representation models in visual tracking generally contain three components: 1) extracting local descriptors from target region, 2) encoding the extracted local descriptors as mid-level features, 3) aggregating statistics of mid-level features into a signature. Since the last step aggregates only first-order statistics of mid-level features, it is named as First-order Pooling (FP). However, FP lacks highorder statistical information of target. Hence, it couldn't reflect the correlation of features, which leads to poor tracking performance. In this paper, we introduce an appearance model for visual tracking that conducts High-order Pooling (HP) over mid-level features under the framework of sparse coding. Instead of first-order signature, we find that higher-order statistics of mid-level features with additional image information could bring large tracking performance gains. Moreover, a simple but effective updating scheme is adopted to improve the tracker adaptability. Experiments on various challenging videos show that the tracking performance with appearance model using HP is superior to those using FP.

KW - High-order Pooling

KW - Mid-level features

KW - Object tracking

KW - Sparse coding

UR - http://www.scopus.com/inward/record.url?scp=85030214933&partnerID=8YFLogxK

U2 - 10.1109/ICME.2017.8019349

DO - 10.1109/ICME.2017.8019349

M3 - Conference contribution

AN - SCOPUS:85030214933

T3 - Proceedings - IEEE International Conference on Multimedia and Expo

SP - 289

EP - 294

BT - 2017 IEEE International Conference on Multimedia and Expo, ICME 2017

PB - IEEE Computer Society

T2 - 2017 IEEE International Conference on Multimedia and Expo, ICME 2017

Y2 - 10 July 2017 through 14 July 2017

ER -

Online visual tracking with high-order pooling

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此