Visual word soft-histogram for image representation

Yan Jie Wang; Xia Bi Liu; Yun De Jia

doi:10.3724/SP.J.1001.2012.04082

Visual word soft-histogram for image representation

Yan Jie Wang, Xia Bi Liu^*, Yun De Jia

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.

源语言	英语
页（从-至）	1787-1795
页数	9
期刊	Ruan Jian Xue Bao/Journal of Software
卷	23
期	7
DOI	https://doi.org/10.3724/SP.J.1001.2012.04082
出版状态	已出版 - 7月 2012

访问文件

10.3724/SP.J.1001.2012.04082

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{8cfada27f9dc4d8a9498cfac0be27577,

title = "Visual word soft-histogram for image representation",

abstract = "This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.",

keywords = "Discriminative learning, Gaussian mixture model, Image representation, Soft-histogram, Visual word",

author = "Wang, {Yan Jie} and Liu, {Xia Bi} and Jia, {Yun De}",

year = "2012",

month = jul,

doi = "10.3724/SP.J.1001.2012.04082",

language = "English",

volume = "23",

pages = "1787--1795",

journal = "Ruan Jian Xue Bao/Journal of Software",

issn = "1000-9825",

publisher = "Chinese Academy of Sciences",

number = "7",

}

TY - JOUR

T1 - Visual word soft-histogram for image representation

AU - Wang, Yan Jie

AU - Liu, Xia Bi

AU - Jia, Yun De

PY - 2012/7

Y1 - 2012/7

N2 - This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.

AB - This paper proposes a visual word soft-histogram for image representation based on statistical modeling and discriminative learning of visual words. This type of learning uses Gaussian mixture models (GMM) to reflect the appearance variation of each visual word and employs the max-min posterior pseudo-probabilities discriminative learning method to estimate GMMs of visual words. The similarities between each visual word and corresponding local features are computed, summed, and normalized to construct a soft-histogram. This paper also discusses the implementation of two representation methods. The first one is called classification-based soft histogram, in which each local feature is assigned to only one visual word with maximum similarity. The second one is called completely soft histogram, in which each local feature is assigned to all the visual words. The experimental results of Caltech-4 and PASCAL VOC 2006 confirm the effectiveness of this method.

KW - Discriminative learning

KW - Gaussian mixture model

KW - Image representation

KW - Soft-histogram

KW - Visual word

UR - http://www.scopus.com/inward/record.url?scp=84865142508&partnerID=8YFLogxK

U2 - 10.3724/SP.J.1001.2012.04082

DO - 10.3724/SP.J.1001.2012.04082

M3 - Article

AN - SCOPUS:84865142508

SN - 1000-9825

VL - 23

SP - 1787

EP - 1795

JO - Ruan Jian Xue Bao/Journal of Software

JF - Ruan Jian Xue Bao/Journal of Software

IS - 7

ER -

Visual word soft-histogram for image representation

摘要

访问文件

其它文件与链接

指纹

引用此