Partitioned k-means clustering for fast construction of unbiased visual vocabulary

Shikui Wei; Xinxiao Wu; Dong Xu

doi:10.1007/978-1-4614-3501-3_40

Partitioned k-means clustering for fast construction of unbiased visual vocabulary

Shikui Wei^*, Xinxiao Wu, Dong Xu

^*此作品的通讯作者

科研成果: 书/报告/会议事项章节 › 章节 › 同行评审

6 引用（Scopus）

摘要

Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.

源语言	英语
主期刊名	The Era of Interactive Media
出版商	Springer New York
页	483-493
页数	11
卷	9781461435013
ISBN（电子版）	9781461435013
ISBN（印刷版）	1461435005, 9781461435006
DOI	https://doi.org/10.1007/978-1-4614-3501-3_40
出版状态	已出版 - 1 10月 2013
已对外发布	是

访问文件

10.1007/978-1-4614-3501-3_40

其它文件与链接

链接到 Scopus 的出版物

引用此

@inbook{ceb92df6c55b453eadc96d8c4461cd6a,

title = "Partitioned k-means clustering for fast construction of unbiased visual vocabulary",

abstract = "Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.",

keywords = "BoW, Image retrieval, Partitioned K-means clustering",

author = "Shikui Wei and Xinxiao Wu and Dong Xu",

year = "2013",

month = oct,

day = "1",

doi = "10.1007/978-1-4614-3501-3_40",

language = "English",

isbn = "1461435005",

volume = "9781461435013",

pages = "483--493",

booktitle = "The Era of Interactive Media",

publisher = "Springer New York",

address = "United States",

}

TY - CHAP

T1 - Partitioned k-means clustering for fast construction of unbiased visual vocabulary

AU - Wei, Shikui

AU - Wu, Xinxiao

AU - Xu, Dong

PY - 2013/10/1

Y1 - 2013/10/1

N2 - Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.

AB - Bag-of-Words (BoW) model has been widely used for feature representation in multimedia search area, in which a key step is to vector-quantize local image descriptors and generate a visual vocabulary. Popular visual vocabulary construction schemes generally perform a flat or hierarchical clustering operation using a very large training set in their original description space. However, these methods usually suffer from two issues: (1) A large training set is required to construct a large visual vocabulary, making the construction computationally inefficient; (2) The generated visual vocabularies are heavily biased towards the training samples. In this work, we introduce a partitioned k-means clustering (PKM) scheme to efficiently generate a large and unbiased vocabulary using only a small training set. Instead of directly clustering training descriptors in their original space, we first split the original space into a set of subspaces and then perform a separate k-means clustering process in each subspace. Sequentially, we can build a complete visual vocabulary by combining different cluster centroids from multiple subspaces. Comprehensive experiments demonstrate that the proposed method indeed generates unbiased vocabularies and provides good scalability for building large vocabularies.

KW - BoW

KW - Image retrieval

KW - Partitioned K-means clustering

UR - http://www.scopus.com/inward/record.url?scp=84890032488&partnerID=8YFLogxK

U2 - 10.1007/978-1-4614-3501-3_40

DO - 10.1007/978-1-4614-3501-3_40

M3 - Chapter

AN - SCOPUS:84890032488

SN - 1461435005

SN - 9781461435006

VL - 9781461435013

SP - 483

EP - 493

BT - The Era of Interactive Media

PB - Springer New York

ER -

Partitioned k-means clustering for fast construction of unbiased visual vocabulary

摘要

访问文件

其它文件与链接

指纹

引用此