Efficient outlier detection algorithm for heterogeneous data streams

Jiadong Ren*, Qunhui Wu, Jia Zhang, Changzhen Hu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

10 Citations (Scopus)

Abstract

Data streams outlier mining is an important and active research issue in anomaly detection. Most of the existing outlier detection algorithms can only manipulate numeric attributes or categorical attributes. In this paper, we propose an efficient outlier detection algorithm based on heterogeneous data streams, which partitions the stream in chunks. Then each chunk is clustered and the corresponding clustering results are stored in cluster references. The representation degree and the number of adjacent cluster references of each cluster reference are computed to generate the final outlier references, which include potential outliers. Experimental results show that our approach has higher detection precision and better scalability.

Original languageEnglish
Title of host publication6th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2009
Pages259-264
Number of pages6
DOIs
Publication statusPublished - 2009
Event6th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2009 - Tianjin, China
Duration: 14 Aug 200916 Aug 2009

Publication series

Name6th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2009
Volume5

Conference

Conference6th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2009
Country/TerritoryChina
CityTianjin
Period14/08/0916/08/09

Keywords

  • Cluster reference
  • Data streams
  • Heterogeneous attributes
  • Outlier detection

Fingerprint

Dive into the research topics of 'Efficient outlier detection algorithm for heterogeneous data streams'. Together they form a unique fingerprint.

Cite this