An algorithm for clustering uncertain data streams over sliding windows

Guoyan Huang*, Dapeng Liang, Jiadong Ren, Changzhen Hu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

The existing algorithms for clustering data streams with uncertainty can not analyze recent data in detail. In this paper, we propose SWCUStreams (Clustering Uncertain Data Streams over Sliding Windows) to cluster uncertain data streams, which can obtain the distribution character of recent data by maintaining the Exponential Histogram of Uncertainty Cluster Feature (EHUCF). SWCUStreams adopts the clustering framework of CluStream. In the online micro-cluster phase, Uncertainty Temporal Cluster Feature (UTCF) is defined to describe the uncertainty tuples. Based on the Uncertainty Temporal Cluster Feature (UTCF), Exponential Histogram of Uncertainty Cluster Feature is proposed to store the distribution character of recent data as well as used to dynamically delete expired records included in EHUCF by associating with UTCF. In the offline macro-cluster phase, the final clustering results will be generated according to the statistic information of Exponential Histogram of Uncertainty Cluster Feature (EHUCF) by UK-means algorithm. The experimental results over different types of data sets show that the cluster quality of SWCUStreams is higher.

源语言英语
主期刊名Proceeding - 6th International Conference on Digital Content, Multimedia Technology and Its Applications, IDC2010
173-177
页数5
出版状态已出版 - 2010
活动6th International Conference on Digital Content, Multimedia Technology and Its Applications, IDC2010 - Seoul, 韩国
期限: 16 8月 201018 8月 2010

出版系列

姓名Proceeding - 6th International Conference on Digital Content, Multimedia Technology and Its Applications, IDC2010

会议

会议6th International Conference on Digital Content, Multimedia Technology and Its Applications, IDC2010
国家/地区韩国
Seoul
时期16/08/1018/08/10

指纹

探究 'An algorithm for clustering uncertain data streams over sliding windows' 的科研主题。它们共同构成独一无二的指纹。

引用此