Cost-optimized microblog distribution over geo-distributed data centers: Insights from cross-media analysis

Han Hu, Yonggang Wen, Tat Seng Chua, Xuelong Li

Research output: Contribution to journalArticlepeer-review

6 Citations (Scopus)

Abstract

The unprecedent growth of microblog services poses significant challenges on network traffic and service latency to the underlay infrastructure (i.e., geo-distributed data centers). Furthermore, the dynamic evolution in microblog status generates a huge workload on data consistence maintenance. In this article, motivated by insights of cross-media analysis-based propagation patterns, we propose a novel cache strategy for microblog service systems to reduce the inter-data center traffic and consistence maintenance cost, while achieving low service latency. Specifically, we first present a microblog classification method, which utilizes the external knowledge from correlated domains, to categorize microblogs. Then we conduct a large-scale measurement on a representative online social network system to study the category-based propagation diversity on region and time scales. These insights illustrate social common habits on creating and consuming microblogs and further motivate our architecture design. Finally, we formulate the content cache problem as a constrained optimization problem. By jointly using the Lyapunov optimization framework and simplex gradient method, we find the optimal online control strategy. Extensive trace-driven experiments further demonstrate that our algorithm reduces the system cost by 24.5% against traditional approaches with the same service latency.

Original languageEnglish
Article number40
JournalACM Transactions on Intelligent Systems and Technology
Volume8
Issue number3
DOIs
Publication statusPublished - Apr 2017
Externally publishedYes

Keywords

  • Cross-media analysis
  • Data center
  • Performance optimization
  • Social media analytics

Fingerprint

Dive into the research topics of 'Cost-optimized microblog distribution over geo-distributed data centers: Insights from cross-media analysis'. Together they form a unique fingerprint.

Cite this