Visual word soft-histogram for image representation

Yan Jie Wang; Xia Bi Liu; Yun De Jia

doi:10.3724/SP.J.1001.2012.04082

Visual word soft-histogram for image representation

Yan Jie Wang, Xia Bi Liu^*, Yun De Jia

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.

Original language	English
Pages (from-to)	1787-1795
Number of pages	9
Journal	Ruan Jian Xue Bao/Journal of Software
Volume	23
Issue number	7
DOIs	https://doi.org/10.3724/SP.J.1001.2012.04082
Publication status	Published - Jul 2012

Keywords

Discriminative learning
Gaussian mixture model
Image representation
Soft-histogram
Visual word

Access to Document

10.3724/SP.J.1001.2012.04082

Cite this

@article{8cfada27f9dc4d8a9498cfac0be27577,

title = "Visual word soft-histogram for image representation",

abstract = "This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.",

keywords = "Discriminative learning, Gaussian mixture model, Image representation, Soft-histogram, Visual word",

author = "Wang, {Yan Jie} and Liu, {Xia Bi} and Jia, {Yun De}",

year = "2012",

month = jul,

doi = "10.3724/SP.J.1001.2012.04082",

language = "English",

volume = "23",

pages = "1787--1795",

journal = "Ruan Jian Xue Bao/Journal of Software",

issn = "1000-9825",

publisher = "Chinese Academy of Sciences",

number = "7",

}

TY - JOUR

T1 - Visual word soft-histogram for image representation

AU - Wang, Yan Jie

AU - Liu, Xia Bi

AU - Jia, Yun De

PY - 2012/7

Y1 - 2012/7

N2 - This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.

AB - This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.

KW - Discriminative learning

KW - Gaussian mixture model

KW - Image representation

KW - Soft-histogram

KW - Visual word

UR - http://www.scopus.com/inward/record.url?scp=84865142508&partnerID=8YFLogxK

U2 - 10.3724/SP.J.1001.2012.04082

DO - 10.3724/SP.J.1001.2012.04082

M3 - Article

AN - SCOPUS:84865142508

SN - 1000-9825

VL - 23

SP - 1787

EP - 1795

JO - Ruan Jian Xue Bao/Journal of Software

JF - Ruan Jian Xue Bao/Journal of Software

IS - 7

ER -

Visual word soft-histogram for image representation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this