Grid Probability Density-based clustering for uncertain data streams over sliding windows

  • Guoyang Huang*
  • , Dapeng Liang
  • , Changzhen Hu
  • , Jiadong Ren
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Citations (Scopus)

Abstract

The existing algorithms for clustering uncertain data streams are unable to obtain clusters of arbitrary shapes. In order to address this issue, this paper proposes GD-CUStreams, which adopts two phased-based clustering frameworks. In the online phase, the store space is divided into grid and Uncertainty Grid Clustering Feature (UGCF) is defined to acquire the uncertainty information of tuple and the summary information is stored in UGCF. In the offline phase, according to the Grid Probability Density threshold, GD-CUStreams detects all grids. Furthermore, the type of grid is determined and sporadic grids will be detected from all sparse grids based on density threshold function. While the clustering request arrives, GD-CUStreams outputs all grids with the type of normal and transition. Finally, clusters of arbitrary shapes are generated. Experimental results show that GD-CUStreams has higher clustering quality. ICIC International

Original languageEnglish
Pages (from-to)1359-1364
Number of pages6
JournalICIC Express Letters
Volume5
Issue number4 B
Publication statusPublished - Apr 2011

Keywords

  • Clustering
  • Grid Probability Density
  • Sliding windows
  • Uncertain data streams

Fingerprint

Dive into the research topics of 'Grid Probability Density-based clustering for uncertain data streams over sliding windows'. Together they form a unique fingerprint.

Cite this