Partitioned k-means clustering for fast construction of unbiased visual vocabulary

Shikui Wei; Xinxiao Wu; Dong Xu

doi:10.1007/978-1-4614-3501-3_40

Partitioned k-means clustering for fast construction of unbiased visual vocabulary

Shikui Wei^*, Xinxiao Wu, Dong Xu

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Chapter › peer-review

6 Citations (Scopus)

Abstract

Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.

Original language	English
Title of host publication	The Era of Interactive Media
Publisher	Springer New York
Pages	483-493
Number of pages	11
Volume	9781461435013
ISBN (Electronic)	9781461435013
ISBN (Print)	1461435005, 9781461435006
DOIs	https://doi.org/10.1007/978-1-4614-3501-3_40
Publication status	Published - 1 Oct 2013
Externally published	Yes

Keywords

BoW
Image retrieval
Partitioned K-means clustering

Access to Document

10.1007/978-1-4614-3501-3_40

Cite this

@inbook{ceb92df6c55b453eadc96d8c4461cd6a,

title = "Partitioned k-means clustering for fast construction of unbiased visual vocabulary",

abstract = "Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.",

keywords = "BoW, Image retrieval, Partitioned K-means clustering",

author = "Shikui Wei and Xinxiao Wu and Dong Xu",

year = "2013",

month = oct,

day = "1",

doi = "10.1007/978-1-4614-3501-3_40",

language = "English",

isbn = "1461435005",

volume = "9781461435013",

pages = "483--493",

booktitle = "The Era of Interactive Media",

publisher = "Springer New York",

address = "United States",

}

TY - CHAP

T1 - Partitioned k-means clustering for fast construction of unbiased visual vocabulary

AU - Wei, Shikui

AU - Wu, Xinxiao

AU - Xu, Dong

PY - 2013/10/1

Y1 - 2013/10/1

N2 - Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.

AB - Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.

KW - BoW

KW - Image retrieval

KW - Partitioned K-means clustering

UR - http://www.scopus.com/inward/record.url?scp=84890032488&partnerID=8YFLogxK

U2 - 10.1007/978-1-4614-3501-3_40

DO - 10.1007/978-1-4614-3501-3_40

M3 - Chapter

AN - SCOPUS:84890032488

SN - 1461435005

SN - 9781461435006

VL - 9781461435013

SP - 483

EP - 493

BT - The Era of Interactive Media

PB - Springer New York

ER -

Partitioned k-means clustering for fast construction of unbiased visual vocabulary

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this