Robust clustering with distance and density

Hanning Yuan*, Shuliang Wang, Jing Geng, Yang Yu, Ming Zhong

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

Clustering is fundamental for using big data. However, AP (affinity propagation) is not good at non-convex datasets, and the input parameter has a marked impact on DBSCAN (density-based spatial clustering of applications with noise). Moreover, new characteristics such as volume, variety, velocity, veracity make it difficult to group big data. To address the issues, a parameter free AP (PFAP) is proposed to group big data on the basis of both distance and density. Firstly, it obtains a group of normalized density from the AP clustering. The estimated parameters are monotonically. Then, the density is used for density clustering for multiple times. Finally, the multiple-density clustering results undergo a two-stage amalgamation to achieve the final clustering result. Experimental results on several benchmark datasets show that PFAP has been achieved better clustering quality than DBSCAN, AP, and APSCAN. And it also has better performance than APSCAN and FSDP.

源语言英语
页(从-至)63-74
页数12
期刊International Journal of Data Warehousing and Mining
13
2
DOI
出版状态已出版 - 1 4月 2017

指纹

探究 'Robust clustering with distance and density' 的科研主题。它们共同构成独一无二的指纹。

引用此