Exact Single-Source SimRank Computation on Large Graphs

Hanzhi Wang, Zhewei Wei, Ye Yuan, Xiaoyong Du, Ji Rong Wen

科研成果: 书/报告/会议事项章节会议稿件同行评审

19 引用 (Scopus)

摘要

SimRank is a popular measurement for evaluating the node-to-node similarities based on the graph topology. In recent years, single-source and top-k SimRank queries have received increasing attention due to their applications in web mining, social network analysis, and spam detection. However, a fundamental obstacle in studying SimRank has been the lack of ground truths. The only exact algorithm, Power Method, is computationally infeasible on graphs with more than 106 nodes. Consequently, no existing work has evaluated the actual trade-offs between query time and accuracy on large real-world graphs. In this paper, we present ExSim, the first algorithm that computes the exact single-source and top-k SimRank results on large graphs. With high probability, this algorithm produces ground truths with a rigorous theoretical guarantee. We conduct extensive experiments on real-world datasets to demonstrate the efficiency of ExactSim. The results show that ExactSim provides the ground truth for any single-source SimRank query with a precision up to 7 decimal places within a reasonable query time.

源语言英语
主期刊名SIGMOD 2020 - Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data
出版商Association for Computing Machinery
653-663
页数11
ISBN(电子版)9781450367356
DOI
出版状态已出版 - 14 6月 2020
活动2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020 - Portland, 美国
期限: 14 6月 202019 6月 2020

出版系列

姓名Proceedings of the ACM SIGMOD International Conference on Management of Data
ISSN(印刷版)0730-8078

会议

会议2020 ACM SIGMOD International Conference on Management of Data, SIGMOD 2020
国家/地区美国
Portland
时期14/06/2019/06/20

指纹

探究 'Exact Single-Source SimRank Computation on Large Graphs' 的科研主题。它们共同构成独一无二的指纹。

引用此