I/O cost minimization: Reachability queries processing over massive graphs

Zhiwei Zhang*, Jeffrey Xu Yu, Lu Qin, Qing Zhu, Xiaofang Zhou

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

15 引用 (Scopus)

摘要

Given a directed graph G, a reachability query (u, v) asks whether there exists a path from a node u to a node v in G. The existing studies support reachability queries using indexing techniques, where both the graph and the index are required to reside in main memory. However, they cannot handle reachability queries on massive graphs, when the graph and the index cannot be entirely held in memory because of the high I/O cost. In this paper, we focus on how to minimize the I/O cost when answering reachability queries on massive graphs that cannot reside entirely in memory. First, we propose a new Yes-Label scheme, as a complement of the No-Label used in GRAIL [23], to reduce the number of intermediate results generated. Second, we show how to minimize the number of I/Os using a heap-on-disk data structure when traversing a graph. We also propose new methods to partition the heap-on-disk, in order to ensure that only sequential I/Os are performed. Third, we analyze our approaches and show how to extend our approaches to answer multiple reachability queries effectively. Finally, we conducted extensive performance studies on both large synthetic and large real graphs, and confirm the efficiency of our approaches.

源语言英语
主期刊名Advances in Database Technology - EDBT 2012
主期刊副标题15th International Conference on Extending Database Technology, Proceedings
468-479
页数12
DOI
出版状态已出版 - 2012
已对外发布
活动15th International Conference on Extending Database Technology, EDBT 2012 - Berlin, 德国
期限: 27 3月 201230 3月 2012

出版系列

姓名ACM International Conference Proceeding Series

会议

会议15th International Conference on Extending Database Technology, EDBT 2012
国家/地区德国
Berlin
时期27/03/1230/03/12

指纹

探究 'I/O cost minimization: Reachability queries processing over massive graphs' 的科研主题。它们共同构成独一无二的指纹。

引用此