An algorithm for clustering uncertain data streams over sliding windows

Guoyan Huang*, Dapeng Liang, Jiadong Ren, Changzhen Hu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

The existing algorithms for clustering data streams with uncertainty can not analyze recent data in detail. In this paper, we propose SWCUStreams (Clustering Uncertain Data Streams over Sliding Windows) to cluster uncertain data streams, which can obtain the distribution character of recent data by maintaining the Exponential Histogram of Uncertainty Cluster Feature (EHUCF). SWCUStreams adopts the clustering framework of CluStream. In the online micro-cluster phase, Uncertainty Temporal Cluster Feature (UTCF) is defined to describe the uncertainty tuples. Based on the Uncertainty Temporal Cluster Feature (UTCF), Exponential Histogram of Uncertainty Cluster Feature is proposed to store the distribution character of recent data as well as used to dynamically delete expired records included in EHUCF by associating with UTCF. In the offline macro-cluster phase, the final clustering results will be generated according to the statistic information of Exponential Histogram of Uncertainty Cluster Feature (EHUCF) by UK-means algorithm. The experimental results over different types of data sets show that the cluster quality of SWCUStreams is higher.

Original languageEnglish
Title of host publicationProceeding - 6th International Conference on Digital Content, Multimedia Technology and Its Applications, IDC2010
Pages173-177
Number of pages5
Publication statusPublished - 2010
Event6th International Conference on Digital Content, Multimedia Technology and Its Applications, IDC2010 - Seoul, Korea, Republic of
Duration: 16 Aug 201018 Aug 2010

Publication series

NameProceeding - 6th International Conference on Digital Content, Multimedia Technology and Its Applications, IDC2010

Conference

Conference6th International Conference on Digital Content, Multimedia Technology and Its Applications, IDC2010
Country/TerritoryKorea, Republic of
CitySeoul
Period16/08/1018/08/10

Keywords

  • Clustering
  • Sliding windows
  • Uncertain data streams

Fingerprint

Dive into the research topics of 'An algorithm for clustering uncertain data streams over sliding windows'. Together they form a unique fingerprint.

Cite this