DistR: A Distributed Method for the Reachability Query over Large Uncertain Graphs

Yurong Cheng, Ye Yuan, Lei Chen, Guoren Wang, Christophe Giraud-Carrier, Yongjiao Sun

Research output: Contribution to journalArticlepeer-review

26 Citations (Scopus)

Abstract

Among uncertain graph queries, reachability, i.e., the probability that one vertex is reachable from another, is likely the most fundamental one. Although this problem has been studied within the field of network reliability, solutions are implemented on a single computer and can only handle small graphs. However, as the size of graph applications continually increases, the corresponding graph data can no longer fit within a single computer's memory and must therefore be distributed across several machines. Furthermore, the computation of probabilistic reachability queries is #P-complete making it very expensive even on small graphs. In this paper, we develop an efficient distributed strategy, called DistR, to solve the problem of reachability query over large uncertain graphs. Specifically, we perform the task in two steps: distributed graph reduction and distributed consolidation. In the distributed graph reduction step, we find all of the maximal subgraphs of the original graph, whose reachability probabilities can be calculated in polynomial time, compute them and reduce the graph accordingly. After this step, only a small graph remains. In the distributed consolidation step, we transform the problem into a relational join process and provide an approximate answer to the #P-complete reachability query. Extensive experimental studies show that our distributed approach is efficient in terms of both computational and communication costs, and has high accuracy.

Original languageEnglish
Article number7420710
Pages (from-to)3172-3185
Number of pages14
JournalIEEE Transactions on Parallel and Distributed Systems
Volume27
Issue number11
DOIs
Publication statusPublished - 1 Nov 2016
Externally publishedYes

Keywords

  • Reachability
  • distributed computing
  • uncertain graphs

Fingerprint

Dive into the research topics of 'DistR: A Distributed Method for the Reachability Query over Large Uncertain Graphs'. Together they form a unique fingerprint.

Cite this