Optimal bandwidth selection for density-based clustering

Hong Jin, Shuliang Wang*, Qian Zhou, Ying Li

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Cluster analysis has long played an important role in a wide variety of data applications. When the clusters are irregular or intertwined, density-based clustering is proved to be much more efficient. The quality of clustering result depends on an adequate choice of the parameters. However, withouenough domain knowledge the parameter setting is somewhat limited in its operability. In this paper, a new method is proposed to automatically find outhe optimal parameter value of the bandwidth. It is to infer the most suitable parameter value by the constructed model on parameter estimation. Based on the Bayesian Theorem, from which the most probability value for the bandwidth can be acquired in accordance with the inherent distribution characteristics of the original data set. Clusters can then be identified by the determined parameter values. The results of the experiment show that the proposed method has complementary advantages in the density-based clustering algorithm.

源语言英语
主期刊名Database Systems for Adanced Applications - 16th International Conference, DASFAA 2011, International Workshops
主期刊副标题GDB, SIM3, FlashDB, SNSMW, DaMEN, DQIS, Proceedings
编辑Jianliang Xu, Ge Yu, Shuigeng Zhou, Rainer Unland
出版商Springer Verlag
156-167
页数12
ISBN(印刷版)9783642202438
DOI
出版状态已出版 - 2011
已对外发布
活动16th International Conference on Database Systems for Advanced Applications, DASFAA 2011 - Hong Kong, 中国
期限: 22 4月 201125 4月 2011

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
6637 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议16th International Conference on Database Systems for Advanced Applications, DASFAA 2011
国家/地区中国
Hong Kong
时期22/04/1125/04/11

指纹

探究 'Optimal bandwidth selection for density-based clustering' 的科研主题。它们共同构成独一无二的指纹。

引用此