An improved outlier detection method in high-dimension based on weighted hypergraph

Yin Zhao Li*, Di Wu, Jia Dong Ren, Chang Zhen Hu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

Outlier detection in high-dimensional space is a hot topic in data mining, the main goal is to find out a small quantity of data objects with abnormal behavior in data set. In this paper, the concepts of the feature vector and the attribute similarity are defined, an improved algorithm SWHOT based on weighed hypergraph model for outlier detection in high dimensional space is presented. The objects in high dimensional space are translated into binary data type, by looking for the hyperedge of binary set, the data set hypergarph model is established, meanwhile, the weight of the hyperedge is equal to the value of the attribute similarity. In addition, the objects of the hypergraph are clustered by CURE algorithm, arbitrary shaped clusters can be identified. Furthermore, the outliers are found according to the point-to-window weighted support, the point-to-class belongingness and the point-to-window weighted deviation of size, the meaningful outliers in high-dimension can be mined by means of appropriate user-defined threshold. Experimental results show that SWHOT can improve scaling and precision.

Original languageEnglish
Title of host publication2nd International Symposium on Electronic Commerce and Security, ISECS 2009
Pages159-163
Number of pages5
DOIs
Publication statusPublished - 2009
Event2nd International Symposium on Electronic Commerce and Security, ISECS 2009 - Nanchang, China
Duration: 22 May 200924 May 2009

Publication series

Name2nd International Symposium on Electronic Commerce and Security, ISECS 2009
Volume2

Conference

Conference2nd International Symposium on Electronic Commerce and Security, ISECS 2009
Country/TerritoryChina
CityNanchang
Period22/05/0924/05/09

Keywords

  • Clustering
  • Hypergraph
  • Outlier detection
  • Similarity
  • Weight

Fingerprint

Dive into the research topics of 'An improved outlier detection method in high-dimension based on weighted hypergraph'. Together they form a unique fingerprint.

Cite this