Efficient Graph Query Processing over Geo-Distributed Datacenters

Ye Yuan, Delong Ma, Zhenyu Wen, Yuliang Ma, Guoren Wang, Lei Chen

科研成果: 书/报告/会议事项章节会议稿件同行评审

10 引用 (Scopus)

摘要

Graph queries have emerged as one of the fundamental techniques to support modern search services, such as PageRank web search, social networking search and knowledge graph search. As such graphs are maintained globally and very huge (e.g., billions of nodes), we need to efficiently process graph queries across multiple geographically distributed datacenters, running geo-distributed graph queries. Existing graph computing frameworks may not work well for geographically distributed datacenters, because they implement a Bulk Synchronous Parallel model that requires excessive inter-datacenter transfers, thereby introducing extremely large latency for query processing. In this paper, we propose GeoGraph-a universal framework to support efficient geo-distributed graph query processing based on clustering datacenters and meta-graph, while reducing the inter-datacenter communication. Our new framework can be applied to many types of graph algorithms without any modification. The framework is developed on the top of Apache Giraph. The experiments were conducted by applying four important graph queries, i.e., shortest path, graph keyword search, subgraph isomorphism and PageRank. The evaluation results show that our proposed framework can achieve up to 82% faster convergence, 42% lower WAN bandwidth usage, and 45% less total monetary cost for the four graph queries, with input graphs stored across ten geo-distributed datacenters.

源语言英语
主期刊名SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval
出版商Association for Computing Machinery, Inc
619-628
页数10
ISBN(电子版)9781450380164
DOI
出版状态已出版 - 25 7月 2020
活动43rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2020 - Virtual, Online, 中国
期限: 25 7月 202030 7月 2020

出版系列

姓名SIGIR 2020 - Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

会议

会议43rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2020
国家/地区中国
Virtual, Online
时期25/07/2030/07/20

指纹

探究 'Efficient Graph Query Processing over Geo-Distributed Datacenters' 的科研主题。它们共同构成独一无二的指纹。

引用此