MapReduce-Based Graph Structural Clustering Algorithm

Wei Peng Zhang, Zhen Jun Li*, Rong Hua Li, Yu Hong Liu, Rui Mao, Shao Jie Qiao

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

Graph Clustering is a fundamental task for graph mining which has been widely used in social network analysis related applications. Graph structural clustering (SCAN) is a well-known density-based graph clustering algorithm. SCAN algorithm can not only find the clusters in a graph, but also be able to identify hub nodes and outliers. However, with the growing graph size, the traditional SCAN algorithm is very hard to handle massive graph data, as its time complexity is O(m1.5) (m is the number of edges in the graph). To overcome the scalability issue of SCAN algorithm, this paper proposes a MapReduce based graph structural clustering algorithm, called MRSCAN. Specifically, the paper develops a MapReduce based similarity computation, a core node computation, as well as two clustering merging algorithms. In addition, it conducts extensive experiments over serval real-world graph datasets, and results demonstrate the accuracy, effectiveness, and scalability of the presented algorithm.

Original languageEnglish
Pages (from-to)627-641
Number of pages15
JournalRuan Jian Xue Bao/Journal of Software
Volume29
Issue number3
DOIs
Publication statusPublished - 2018
Externally publishedYes

Keywords

  • Graph data
  • MapReduce
  • Parallel computing model
  • Structural graph clustering

Fingerprint

Dive into the research topics of 'MapReduce-Based Graph Structural Clustering Algorithm'. Together they form a unique fingerprint.

Cite this