跳到主要导航 跳到搜索 跳到主要内容

Cost-Aware Triangle Counting Over Geo-Distributed Datacenters

  • Delong Ma*
  • , Ye Yuan
  • , Yanfeng Zhang
  • , Chunze Cao
  • , Yuliang Ma
  • *此作品的通讯作者
  • Northeastern University China

科研成果: 期刊稿件文章同行评审

摘要

Counting triangles is an important topic in many practical applications, such as anomaly detection, community search, and recommendation systems. For triangle counting in large and dynamic graphs, recent work has focused on distributed streaming algorithms. These works assume that the graph is processed in the same location, while in reality, the graph stream may be generated and processed in datacenters that are geographically distributed. This raises new challenges to existing triangle counting algorithms, due to the multi-level heterogeneities in network bandwidth and communication prices in geo-distributed datacenters. In this article, we propose a cost-aware framework named GeoTri based on the Master-Worker-Aggregator architecture, which takes both the cost and performance objectives into consideration for triangle counting in geo-distributed datacenters. The two core parts of this framework are the cost-aware nodes assignment strategy in master, which is critical to obtain node’s position and distribute edges reasonably to reduce the cost (i.e., time cost and monetary cost), and cost-aware neighbor transfer strategy among workers, which further eliminates redundancy in data transfers.

源语言英语
页(从-至)2008-2024
页数17
期刊IEEE Transactions on Big Data
11
4
DOI
出版状态已出版 - 2025

指纹

探究 'Cost-Aware Triangle Counting Over Geo-Distributed Datacenters' 的科研主题。它们共同构成独一无二的指纹。

引用此