Searching Density-Increasing Path to Local Density Peaks for Unsupervised Anomaly Detection

Jiachen Zhao, Fang Deng*, Jiaqi Zhu, Jie Chen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Unsupervised anomaly detection (AD) is a challenging problem in the data mining community. Clustering-based AD methods aim to group normal data points into clusters and then regard a point belonging to none of the clusters as an anomaly. However, they may suffer from the problems of unknown cluster numbers and arbitrary cluster shapes. This paper presents a novel clustering-based AD method named Density-increasing Path (DIP) to tackle these challenges. DIP searches a path for each data point. The path starts at the data point itself, passes through several points with monotonically increasing densities, and ends at a density peak. Further, DIP defines the climbing difficulty of each path by combining the distance and density increment of each step along the path, which can be regarded as the anomaly score of the path starting point. DIP can adaptively decide the number of peaks to address the challenge of unknown cluster numbers. Since DIP requires the path to pass several points rather than directly reaching the peak, it handles arbitrary cluster shapes. We also propose the ensemble DIP to improve prediction accuracy. The experimental results on four synthetic datasets and eleven real-world benchmarks demonstrate that DIP outperforms existing methods.

Original languageEnglish
Pages (from-to)1198-1209
Number of pages12
JournalIEEE Transactions on Big Data
Volume9
Issue number4
DOIs
Publication statusPublished - 1 Aug 2023

Keywords

  • Anomaly detection
  • clustering analysis
  • imbalance learning
  • outlier detection

Fingerprint

Dive into the research topics of 'Searching Density-Increasing Path to Local Density Peaks for Unsupervised Anomaly Detection'. Together they form a unique fingerprint.

Cite this