TY - JOUR
T1 - MapReduce-Based Graph Structural Clustering Algorithm
AU - Zhang, Wei Peng
AU - Li, Zhen Jun
AU - Li, Rong Hua
AU - Liu, Yu Hong
AU - Mao, Rui
AU - Qiao, Shao Jie
N1 - Publisher Copyright:
© Copyright 2018, Institute of Software, the Chinese Academy of Sciences. All rights reserved.
PY - 2018
Y1 - 2018
N2 - Graph Clustering is a fundamental task for graph mining which has been widely used in social network analysis related applications. Graph structural clustering (SCAN) is a well-known density-based graph clustering algorithm. SCAN algorithm can not only find the clusters in a graph, but also be able to identify hub nodes and outliers. However, with the growing graph size, the traditional SCAN algorithm is very hard to handle massive graph data, as its time complexity is O(m1.5) (m is the number of edges in the graph). To overcome the scalability issue of SCAN algorithm, this paper proposes a MapReduce based graph structural clustering algorithm, called MRSCAN. Specifically, the paper develops a MapReduce based similarity computation, a core node computation, as well as two clustering merging algorithms. In addition, it conducts extensive experiments over serval real-world graph datasets, and results demonstrate the accuracy, effectiveness, and scalability of the presented algorithm.
AB - Graph Clustering is a fundamental task for graph mining which has been widely used in social network analysis related applications. Graph structural clustering (SCAN) is a well-known density-based graph clustering algorithm. SCAN algorithm can not only find the clusters in a graph, but also be able to identify hub nodes and outliers. However, with the growing graph size, the traditional SCAN algorithm is very hard to handle massive graph data, as its time complexity is O(m1.5) (m is the number of edges in the graph). To overcome the scalability issue of SCAN algorithm, this paper proposes a MapReduce based graph structural clustering algorithm, called MRSCAN. Specifically, the paper develops a MapReduce based similarity computation, a core node computation, as well as two clustering merging algorithms. In addition, it conducts extensive experiments over serval real-world graph datasets, and results demonstrate the accuracy, effectiveness, and scalability of the presented algorithm.
KW - Graph data
KW - MapReduce
KW - Parallel computing model
KW - Structural graph clustering
UR - https://www.scopus.com/pages/publications/85049528841
U2 - 10.13328/j.cnki.jos.005456
DO - 10.13328/j.cnki.jos.005456
M3 - Article
AN - SCOPUS:85049528841
SN - 1000-9825
VL - 29
SP - 627
EP - 641
JO - Ruan Jian Xue Bao/Journal of Software
JF - Ruan Jian Xue Bao/Journal of Software
IS - 3
ER -