Random-walk domination in large graphs

Rong Hua Li; Jeffrey Xu Yu; Xin Huang; Hong Cheng

doi:10.1109/ICDE.2014.6816696

Random-walk domination in large graphs

Rong Hua Li, Jeffrey Xu Yu, Xin Huang, Hong Cheng

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

30 Citations (Scopus)

Abstract

We introduce and formulate two types of random-walk domination problems in graphs motivated by a number of applications in practice (e.g., item-placement problem in online social networks, Ads-placement problem in advertisement networks, and resource-placement problem in P2P networks). Specifically, given a graph G, the goal of the first type of random-walk domination problem is to target k nodes such that the total hitting time of an L-length random walk starting from the remaining nodes to the targeted nodes is minimized. The second type of random-walk domination problem is to find k nodes to maximize the expected number of nodes that hit any one targeted node through an L-length random walk. We prove that these problems are two special instances of the submodular set function maximization with cardinality constraint problem. To solve them effectively, we propose a dynamic-programming (DP) based greedy algorithm which is with near-optimal performance guarantee. The DP-based greedy algorithm, however, is not very efficient due to the expensive marginal gain evaluation. To further speed up the algorithm, we propose an approximate greedy algorithm with linear time complexity w.r.t. the graph size and also with near-optimal performance guarantee. The approximate greedy algorithm is based on carefully designed random walk sampling and sample-materialization techniques. Extensive experiments demonstrate the effectiveness, efficiency and scalability of the proposed algorithms.

Original language	English
Title of host publication	2014 IEEE 30th International Conference on Data Engineering, ICDE 2014
Publisher	IEEE Computer Society
Pages	736-747
Number of pages	12
ISBN (Print)	9781479925544
DOIs	https://doi.org/10.1109/ICDE.2014.6816696
Publication status	Published - 2014
Externally published	Yes
Event	30th IEEE International Conference on Data Engineering, ICDE 2014 - Chicago, IL, United States Duration: 31 Mar 2014 → 4 Apr 2014

Publication series

Name	Proceedings - International Conference on Data Engineering
ISSN (Print)	1084-4627

Conference

Conference	30th IEEE International Conference on Data Engineering, ICDE 2014
Country/Territory	United States
City	Chicago, IL
Period	31/03/14 → 4/04/14

Access to Document

10.1109/ICDE.2014.6816696

Cite this

@inproceedings{53280b65889d418d9f3a1d9b128f2cb2,

title = "Random-walk domination in large graphs",

abstract = "We introduce and formulate two types of random-walk domination problems in graphs motivated by a number of applications in practice (e.g., item-placement problem in online social networks, Ads-placement problem in advertisement networks, and resource-placement problem in P2P networks). Specifically, given a graph G, the goal of the first type of random-walk domination problem is to target k nodes such that the total hitting time of an L-length random walk starting from the remaining nodes to the targeted nodes is minimized. The second type of random-walk domination problem is to find k nodes to maximize the expected number of nodes that hit any one targeted node through an L-length random walk. We prove that these problems are two special instances of the submodular set function maximization with cardinality constraint problem. To solve them effectively, we propose a dynamic-programming (DP) based greedy algorithm which is with near-optimal performance guarantee. The DP-based greedy algorithm, however, is not very efficient due to the expensive marginal gain evaluation. To further speed up the algorithm, we propose an approximate greedy algorithm with linear time complexity w.r.t. the graph size and also with near-optimal performance guarantee. The approximate greedy algorithm is based on carefully designed random walk sampling and sample-materialization techniques. Extensive experiments demonstrate the effectiveness, efficiency and scalability of the proposed algorithms.",

author = "Li, {Rong Hua} and Yu, {Jeffrey Xu} and Xin Huang and Hong Cheng",

year = "2014",

doi = "10.1109/ICDE.2014.6816696",

language = "English",

isbn = "9781479925544",

series = "Proceedings - International Conference on Data Engineering",

publisher = "IEEE Computer Society",

pages = "736--747",

booktitle = "2014 IEEE 30th International Conference on Data Engineering, ICDE 2014",

address = "United States",

note = "30th IEEE International Conference on Data Engineering, ICDE 2014 ; Conference date: 31-03-2014 Through 04-04-2014",

}

Li, RH, Yu, JX, Huang, X & Cheng, H 2014, Random-walk domination in large graphs. in 2014 IEEE 30th International Conference on Data Engineering, ICDE 2014., 6816696, Proceedings - International Conference on Data Engineering, IEEE Computer Society, pp. 736-747, 30th IEEE International Conference on Data Engineering, ICDE 2014, Chicago, IL, United States, 31/03/14. https://doi.org/10.1109/ICDE.2014.6816696

Random-walk domination in large graphs. / Li, Rong Hua; Yu, Jeffrey Xu; Huang, Xin et al.
2014 IEEE 30th International Conference on Data Engineering, ICDE 2014. IEEE Computer Society, 2014. p. 736-747 6816696 (Proceedings - International Conference on Data Engineering).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Random-walk domination in large graphs

AU - Li, Rong Hua

AU - Yu, Jeffrey Xu

AU - Huang, Xin

AU - Cheng, Hong

PY - 2014

Y1 - 2014

N2 - We introduce and formulate two types of random-walk domination problems in graphs motivated by a number of applications in practice (e.g., item-placement problem in online social networks, Ads-placement problem in advertisement networks, and resource-placement problem in P2P networks). Specifically, given a graph G, the goal of the first type of random-walk domination problem is to target k nodes such that the total hitting time of an L-length random walk starting from the remaining nodes to the targeted nodes is minimized. The second type of random-walk domination problem is to find k nodes to maximize the expected number of nodes that hit any one targeted node through an L-length random walk. We prove that these problems are two special instances of the submodular set function maximization with cardinality constraint problem. To solve them effectively, we propose a dynamic-programming (DP) based greedy algorithm which is with near-optimal performance guarantee. The DP-based greedy algorithm, however, is not very efficient due to the expensive marginal gain evaluation. To further speed up the algorithm, we propose an approximate greedy algorithm with linear time complexity w.r.t. the graph size and also with near-optimal performance guarantee. The approximate greedy algorithm is based on carefully designed random walk sampling and sample-materialization techniques. Extensive experiments demonstrate the effectiveness, efficiency and scalability of the proposed algorithms.

AB - We introduce and formulate two types of random-walk domination problems in graphs motivated by a number of applications in practice (e.g., item-placement problem in online social networks, Ads-placement problem in advertisement networks, and resource-placement problem in P2P networks). Specifically, given a graph G, the goal of the first type of random-walk domination problem is to target k nodes such that the total hitting time of an L-length random walk starting from the remaining nodes to the targeted nodes is minimized. The second type of random-walk domination problem is to find k nodes to maximize the expected number of nodes that hit any one targeted node through an L-length random walk. We prove that these problems are two special instances of the submodular set function maximization with cardinality constraint problem. To solve them effectively, we propose a dynamic-programming (DP) based greedy algorithm which is with near-optimal performance guarantee. The DP-based greedy algorithm, however, is not very efficient due to the expensive marginal gain evaluation. To further speed up the algorithm, we propose an approximate greedy algorithm with linear time complexity w.r.t. the graph size and also with near-optimal performance guarantee. The approximate greedy algorithm is based on carefully designed random walk sampling and sample-materialization techniques. Extensive experiments demonstrate the effectiveness, efficiency and scalability of the proposed algorithms.

UR - http://www.scopus.com/inward/record.url?scp=84901811836&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2014.6816696

DO - 10.1109/ICDE.2014.6816696

M3 - Conference contribution

AN - SCOPUS:84901811836

SN - 9781479925544

T3 - Proceedings - International Conference on Data Engineering

SP - 736

EP - 747

BT - 2014 IEEE 30th International Conference on Data Engineering, ICDE 2014

PB - IEEE Computer Society

T2 - 30th IEEE International Conference on Data Engineering, ICDE 2014

Y2 - 31 March 2014 through 4 April 2014

ER -

Random-walk domination in large graphs

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this