Efficient subgraph search over large uncertain graphs

Ye Yuan*, Guoren Wang, Haixun Wang, Lei Chen

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

83 Citations (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 83
  • Captures
    • Readers: 71
see details

Abstract

Retrieving graphs containing a query graph from a large graph database is a key task in many graph-based applications, includ-ing chemical compounds discovery, protein complex prediction, and structural pattern recognition. However, graph data handled by these applications is often noisy, incomplete, and inaccurate be-cause of the way the data is produced. In this paper, we study sub-graph queries over uncertain graphs. Specifically, we consider the problem of answering threshold-based probabilistic queries over a large uncertain graph database with the possible world seman-tics. We prove that problem is #P-complete, therefore, we adopt a filtering-and-verification strategy to speed up the search. In the filtering phase, we use a probabilistic inverted index, PIndex, based on subgraph features obtained by an optimal feature selection pro-cess. During the verification phase, we develop exact and bound algorithms to validate the remaining candidates. Extensive experi-mental results demonstrate the effectiveness of the proposed algo-rithms.

Original languageEnglish
Pages (from-to)876-886
Number of pages11
JournalProceedings of the VLDB Endowment
Volume4
Issue number11
DOIs
Publication statusPublished - Aug 2011
Externally publishedYes
Event37th International Conference on Very Large Data Bases, VLDB 2011 - Seattle, United States
Duration: 29 Aug 20113 Sept 2011

Fingerprint

Dive into the research topics of 'Efficient subgraph search over large uncertain graphs'. Together they form a unique fingerprint.

Cite this

Yuan, Y., Wang, G., Wang, H., & Chen, L. (2011). Efficient subgraph search over large uncertain graphs. Proceedings of the VLDB Endowment, 4(11), 876-886. https://doi.org/10.14778/3402707.3402726