HCluWin: An algorithm for clustering heterogeneous data streams over sliding windows

Jiadong Ren*, Changzhen Hu, Ruiqing Ma

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

9 引用 (Scopus)

摘要

Many applications in web usage mining, such as business intelligence and usage characterization, require effective and efficient techniques to discover the users with similar usage patterns and, the web pages with correlate contents in the physical world. Clustering click streams can help to achieve the goal. Despite the high processing rate, the existing methods for clustering click streams over sliding widows suffer from the missing of categorical attributes in click stream, data. In this paper, we present HCluWin, an approach for clustering heterogeneous data, streams which contain both continuous attributes and, categorical attributes over sliding windows. A Heterogeneous Temporal Cluster Feature (HTCF) is introduced, to m,onitor the distribution statistics of heterogeneous data, points. Based, on this structure, Exponential Histogram, of Heterogeneous Cluster Feature (EHHCF) is presented. Simultaneously, a, new similarity m,ea,sure between two heterogeneous objects is proposed. Experimental results show that the clustering quality of HCluWin is higher than CluWin and, the stream, processing rate of HCluWin is higher than HCluStream,.

源语言英语
页(从-至)2171-2179
页数9
期刊International Journal of Innovative Computing, Information and Control
6
5
出版状态已出版 - 5月 2010

指纹

探究 'HCluWin: An algorithm for clustering heterogeneous data streams over sliding windows' 的科研主题。它们共同构成独一无二的指纹。

引用此