跳到主要导航 跳到搜索 跳到主要内容

Data field for hierarchical clustering

  • Shuliang Wang*
  • , Gan Wenyan Gan
  • , Deyi Li
  • , Deren Li
  • *此作品的通讯作者
  • University of Pittsburgh
  • Wuhan University
  • Nanjing University of Science and Technology
  • Tsinghua University

科研成果: 期刊稿件文章同行评审

摘要

In this paper, data field is proposed to group data objects via simulating their mutual interactions and opposite movements for hierarchical clustering. Enlightened by the field in physical space, data field to simulate nuclear field is presented to illuminate the interaction between objects in data space. In the data field, the self-organized process of equipotential lines on many data objects discovers their hierarchical clustering-characteristics. During the clustering process, a random sample is first generated to optimize the impact factor. The masses of data objects are then estimated to select core data object with nonzero masses. Taking the core data objects as the initial clusters, the clusters are iteratively merged hierarchy by hierarchy with good performance. The results of a case study show that the data field is capable of hierarchical clustering on objects varying size, shape or granularity without user-specified parameters, as well as considering the object features inside the clusters and removing the outliers from noisy data. The comparisons illustrate that the data field clustering performs better than K-means, BIRCH, CURE, and CHAMELEON.

源语言英语
页(从-至)43-63
页数21
期刊International Journal of Data Warehousing and Mining
7
4
DOI
出版状态已出版 - 10月 2011
已对外发布

指纹

探究 'Data field for hierarchical clustering' 的科研主题。它们共同构成独一无二的指纹。

引用此