Generalized pooling pyramid with hierarchical dictionary sparse coding for event and object recognition

Shuai Chen, Bo Ma*, Pei Luo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Feature coding and vector pooling are essential for image recognition in bag-of-visual-words (BoW) method. Encoding the low-level feature to rich one and pooling it without any information loss are very challenging works. In this paper, generalized pooling pyramid with hierarchical dictionary sparse coding is introduced to get rich sparse codes and alleviate the information loss in the phase of pooling. It includes two modules: First, with the low-level feature, hierarchical dictionary is learned for sparse coding to generate the hierarchical sparse representation. Second, in the phase of vector pooling, we present generalized pooling pyramid by utilizing the probabilistic function to model the statistical distribution of sparse codes. In the generalized pooling pyramid, the Fisher vectors which are computed with Gaussian Mixture (GMM) in different levels, are fused to represent the images. The performance of our method outperforms state-of-the-art performance in a large number of image categorization experiments on the event dataset (UIUC-Sport dataset) and the object recognition dataset (Caltech101 dataset).

Original languageEnglish
Title of host publication2017 IEEE International Conference on Image Processing, ICIP 2017 - Proceedings
PublisherIEEE Computer Society
Pages2349-2353
Number of pages5
ISBN (Electronic)9781509021758
DOIs
Publication statusPublished - 2 Jul 2017
Event24th IEEE International Conference on Image Processing, ICIP 2017 - Beijing, China
Duration: 17 Sept 201720 Sept 2017

Publication series

NameProceedings - International Conference on Image Processing, ICIP
Volume2017-September
ISSN (Print)1522-4880

Conference

Conference24th IEEE International Conference on Image Processing, ICIP 2017
Country/TerritoryChina
CityBeijing
Period17/09/1720/09/17

Fingerprint

Dive into the research topics of 'Generalized pooling pyramid with hierarchical dictionary sparse coding for event and object recognition'. Together they form a unique fingerprint.

Cite this