TY - JOUR
T1 - Efficient subgraph search over large uncertain graphs
AU - Yuan, Ye
AU - Wang, Guoren
AU - Wang, Haixun
AU - Chen, Lei
PY - 2011/8
Y1 - 2011/8
N2 - Retrieving graphs containing a query graph from a large graph database is a key task in many graph-based applications, includ-ing chemical compounds discovery, protein complex prediction, and structural pattern recognition. However, graph data handled by these applications is often noisy, incomplete, and inaccurate be-cause of the way the data is produced. In this paper, we study sub-graph queries over uncertain graphs. Specifically, we consider the problem of answering threshold-based probabilistic queries over a large uncertain graph database with the possible world seman-tics. We prove that problem is #P-complete, therefore, we adopt a filtering-and-verification strategy to speed up the search. In the filtering phase, we use a probabilistic inverted index, PIndex, based on subgraph features obtained by an optimal feature selection pro-cess. During the verification phase, we develop exact and bound algorithms to validate the remaining candidates. Extensive experi-mental results demonstrate the effectiveness of the proposed algo-rithms.
AB - Retrieving graphs containing a query graph from a large graph database is a key task in many graph-based applications, includ-ing chemical compounds discovery, protein complex prediction, and structural pattern recognition. However, graph data handled by these applications is often noisy, incomplete, and inaccurate be-cause of the way the data is produced. In this paper, we study sub-graph queries over uncertain graphs. Specifically, we consider the problem of answering threshold-based probabilistic queries over a large uncertain graph database with the possible world seman-tics. We prove that problem is #P-complete, therefore, we adopt a filtering-and-verification strategy to speed up the search. In the filtering phase, we use a probabilistic inverted index, PIndex, based on subgraph features obtained by an optimal feature selection pro-cess. During the verification phase, we develop exact and bound algorithms to validate the remaining candidates. Extensive experi-mental results demonstrate the effectiveness of the proposed algo-rithms.
UR - http://www.scopus.com/inward/record.url?scp=84863753859&partnerID=8YFLogxK
U2 - 10.14778/3402707.3402726
DO - 10.14778/3402707.3402726
M3 - Conference article
AN - SCOPUS:84863753859
SN - 2150-8097
VL - 4
SP - 876
EP - 886
JO - Proceedings of the VLDB Endowment
JF - Proceedings of the VLDB Endowment
IS - 11
T2 - 37th International Conference on Very Large Data Bases, VLDB 2011
Y2 - 29 August 2011 through 3 September 2011
ER -