A K-means approach based on concept hierarchical tree for search results clustering

Peng Jiang*, Chunxia Zhang, Guisuo Guo, Zhendong Niu, Dongping Gao

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

Search results clustering aims to facilitate users' information retrieval process and query refinement by online grouping similar documents returned from the search engine. It has stringent requirements on performance and meaningful cluster labels. Thus, most existing clustering algorithms such as K-means and agglomerative hierarchical clustering cannot be directly applied to the task of online search results clustering. In this paper, we propose a K-means approach based on concept hierarchical tree to cluster search results. This algorithm not only overcomes weaknesses of the classic K-means method: the results produced depend on the initial seeds and the parameter k is often unknown, but also satisfies the requirements of online search results clustering. Our method utilizes the semantic relation among documents by mapping terms to concepts in the concept hierarchical tree, which can be constructed by WordNet. We have developed a meta-search and clustering system based on our approach, followed by using an impersonal and repeatable evaluation solution. Experimental results indicate that our proposed algorithm is effective and suitable in performing the task of clustering search results.

源语言英语
主期刊名6th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2009
380-386
页数7
DOI
出版状态已出版 - 2009
活动6th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2009 - Tianjin, 中国
期限: 14 8月 200916 8月 2009

出版系列

姓名6th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2009
1

会议

会议6th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2009
国家/地区中国
Tianjin
时期14/08/0916/08/09

指纹

探究 'A K-means approach based on concept hierarchical tree for search results clustering' 的科研主题。它们共同构成独一无二的指纹。

引用此