Extreme clustering – A clustering method via density extreme points

Shuliang Wang, Qi Li*, Chuanfeng Zhao, Xingquan Zhu, Hanning Yuan, Tianru Dai

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

60 Citations (Scopus)

Abstract

Peak clustering, a density based clustering method, has shown remarkable performance in clustering analysis of data. In reality, peak clustering suffers from two major drawbacks: (i) when the difference in cluster sample density is significant, it becomes difficult for peak clustering to find cluster centres in low density clusters. (ii) in some cases, it will incorrectly detect many normal points as noises. In this paper, we propose a new extreme clustering method to overcome the drawbacks of peak clustering. The theme of extreme clustering is to identify density extreme points to find cluster centres. In addition, a noise detection module is also introduced to identify noisy data points from the clustering results. As a result, the extreme clustering is robust to datasets with different density distributions. Experiments and validations, on over 40 datasets, show that extreme clustering can not only inherit the cluster validity of peak clustering, but also overcome its shortages with significant performance gain. Case studies on real-world haze analysis also demonstrate the performance of extreme clustering method in finding some main haze origins in a Chinese city.

Original languageEnglish
Pages (from-to)24-39
Number of pages16
JournalInformation Sciences
Volume542
DOIs
Publication statusPublished - 4 Jan 2021

Keywords

  • Clustering
  • Density
  • Density peak clustering
  • Extreme point
  • Haze analysis

Fingerprint

Dive into the research topics of 'Extreme clustering – A clustering method via density extreme points'. Together they form a unique fingerprint.

Cite this