Efficiently Sampling and Estimating Hypergraphs By Hybrid Random Walk

Lingling Zhang*, Zhiwei Zhang, Guoren Wang, Ye Yuan

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Hypergraphs provide a powerful tool for representing group interactions in complicated networks. Analyzing statical properties of hypergraphs by sampling is an increasing fundamental research problem in the field of data processing. However, the state-of-the-art sampling methods either focus on pairwise graphs or are insensitive to the structures formed by vertices and hyperedges, resulting in estimations with low accuracy and efficiency. To efficiently characterize the properties of both vertices and hyperedges, this paper first proposes a hybrid random walk based Markov Chain Monte Carlo (MCMC) model theoretically by carefully designing its mixture states and the transition matrix. For simplifying the implementation of this model, we develop an algorithm formed by vertex and hyperedge transitions saving costs for constructing mixture states in practice along with an estimating method for accurate estimations. Furthermore, we employ a non-backtracking strategy in the vertex transitions to accelerate the convergence of the hybrid random walk and propose to skip the sampled vertices in the hyperedge transitions to avoid being trapped in the local subgraph for improving accuracy and reducing query cost. Extensive experimental results on the real-world datasets confirm the higher accuracy and efficiency of our proposed methods than the sophisticated sampling methods.

源语言英语
主期刊名Proceedings - 2023 IEEE 39th International Conference on Data Engineering, ICDE 2023
出版商IEEE Computer Society
1273-1285
页数13
ISBN(电子版)9798350322279
DOI
出版状态已出版 - 2023
活动39th IEEE International Conference on Data Engineering, ICDE 2023 - Anaheim, 美国
期限: 3 4月 20237 4月 2023

出版系列

姓名Proceedings - International Conference on Data Engineering
2023-April
ISSN(印刷版)1084-4627

会议

会议39th IEEE International Conference on Data Engineering, ICDE 2023
国家/地区美国
Anaheim
时期3/04/237/04/23

指纹

探究 'Efficiently Sampling and Estimating Hypergraphs By Hybrid Random Walk' 的科研主题。它们共同构成独一无二的指纹。

引用此