A weighted subspace clustering algorithm in high-dimensional data streams

Jiadong Ren*, Lining Li, Changzhen Hu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Clustering is a significant and difficult problem in data stream mining due to a mass of streaming data arriving continuously. High-dimensional data streams make clustering analysis more complex because of the sparsity of data. In this paper, we propose a new clustering method for high-dimensional data streams, called WSCStream. The method incorporates a fading cluster structure and a dimensional weight matrix. We assign a weight to each dimension of corresponding cluster in the matrix. The weight associated with each dimension indicates the importance of each dimension to the corresponding cluster. The weighted distance between a cluster and a data point is used to obtain the final clusters as the new data points arrive over time. Experimental results on real and synthetic datasets demonstrate that WSCStream has higher clustering quality than PHStream.

Original languageEnglish
Title of host publication2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009
Pages631-634
Number of pages4
DOIs
Publication statusPublished - 2009
Event2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009 - Kaohsiung, Taiwan, Province of China
Duration: 7 Dec 20099 Dec 2009

Publication series

Name2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009

Conference

Conference2009 4th International Conference on Innovative Computing, Information and Control, ICICIC 2009
Country/TerritoryTaiwan, Province of China
CityKaohsiung
Period7/12/099/12/09

Keywords

  • Data stream
  • High dimension
  • Subspace clustering

Fingerprint

Dive into the research topics of 'A weighted subspace clustering algorithm in high-dimensional data streams'. Together they form a unique fingerprint.

Cite this