Partitioned k-means clustering for fast construction of unbiased visual vocabulary

Shikui Wei*, Xinxiao Wu, Dong Xu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节章节同行评审

6 引用 (Scopus)

摘要

Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.

源语言英语
主期刊名The Era of Interactive Media
出版商Springer New York
483-493
页数11
9781461435013
ISBN(电子版)9781461435013
ISBN(印刷版)1461435005, 9781461435006
DOI
出版状态已出版 - 1 10月 2013
已对外发布

指纹

探究 'Partitioned k-means clustering for fast construction of unbiased visual vocabulary' 的科研主题。它们共同构成独一无二的指纹。

引用此