Answering probabilistic top-k queries over P2P networks

Yong Jiao Sun; Ye Yuan; Guo Ren Wang

doi:10.3724/SP.J.1016.2011.02155

Answering probabilistic top-k queries over P2P networks

Yong Jiao Sun^*, Ye Yuan, Guo Ren Wang

^*Corresponding author for this work

Northeastern University China

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Top-k queries in distributed databases have been studied widely in recent years. There exists an inherent uncertainty on the data objects due to imprecise measurements and network delays. In this paper, based on horizontally distributed data among peers, we propose an efficient approach of processing uncertain top-k queries in P2P networks. Firstly, we construct a distributed index using Quad-tree, and based on the index, propose a spatial pruning algorithm. Secondly, we propose the upper bound of top-k probabilistic according to the relationship between local top-k probabilities and global top-k probabilities. We also propose the lower bound of top-k probabilities according to the relationship between skyline probabilities and top-k probabilities. Using the two probabilistic pruning algorithms, we can further reduce computation costs and network overhead of top-k queries, and further reduce the number of candidate sets. Finally, we develop a sampling algorithm to estimate top-k probabilities of candidates. Extensive experiments are conducted to verify the effectiveness and efficiency of the proposed methods.

Original language	English
Pages (from-to)	2155-2164
Number of pages	10
Journal	Jisuanji Xuebao/Chinese Journal of Computers
Volume	34
Issue number	11
DOIs	https://doi.org/10.3724/SP.J.1016.2011.02155
Publication status	Published - Nov 2011
Externally published	Yes

Keywords

P2P
Quad-tree
Skyline probability
Top-k query
Uncertain data

Access to Document

10.3724/SP.J.1016.2011.02155

Cite this

Sun, Y. J., Yuan, Y., & Wang, G. R. (2011). Answering probabilistic top-k queries over P2P networks. Jisuanji Xuebao/Chinese Journal of Computers, 34(11), 2155-2164. https://doi.org/10.3724/SP.J.1016.2011.02155

@article{a81f6c3e1b72496e99b930be2f457177,

title = "Answering probabilistic top-k queries over P2P networks",

abstract = "Top-k queries in distributed databases have been studied widely in recent years. There exists an inherent uncertainty on the data objects due to imprecise measurements and network delays. In this paper, based on horizontally distributed data among peers, we propose an efficient approach of processing uncertain top-k queries in P2P networks. Firstly, we construct a distributed index using Quad-tree, and based on the index, propose a spatial pruning algorithm. Secondly, we propose the upper bound of top-k probabilistic according to the relationship between local top-k probabilities and global top-k probabilities. We also propose the lower bound of top-k probabilities according to the relationship between skyline probabilities and top-k probabilities. Using the two probabilistic pruning algorithms, we can further reduce computation costs and network overhead of top-k queries, and further reduce the number of candidate sets. Finally, we develop a sampling algorithm to estimate top-k probabilities of candidates. Extensive experiments are conducted to verify the effectiveness and efficiency of the proposed methods.",

keywords = "P2P, Quad-tree, Skyline probability, Top-k query, Uncertain data",

author = "Sun, {Yong Jiao} and Ye Yuan and Wang, {Guo Ren}",

year = "2011",

month = nov,

doi = "10.3724/SP.J.1016.2011.02155",

language = "English",

volume = "34",

pages = "2155--2164",

journal = "Jisuanji Xuebao/Chinese Journal of Computers",

issn = "0254-4164",

publisher = "Science Press",

number = "11",

}

TY - JOUR

T1 - Answering probabilistic top-k queries over P2P networks

AU - Sun, Yong Jiao

AU - Yuan, Ye

AU - Wang, Guo Ren

PY - 2011/11

Y1 - 2011/11

N2 - Top-k queries in distributed databases have been studied widely in recent years. There exists an inherent uncertainty on the data objects due to imprecise measurements and network delays. In this paper, based on horizontally distributed data among peers, we propose an efficient approach of processing uncertain top-k queries in P2P networks. Firstly, we construct a distributed index using Quad-tree, and based on the index, propose a spatial pruning algorithm. Secondly, we propose the upper bound of top-k probabilistic according to the relationship between local top-k probabilities and global top-k probabilities. We also propose the lower bound of top-k probabilities according to the relationship between skyline probabilities and top-k probabilities. Using the two probabilistic pruning algorithms, we can further reduce computation costs and network overhead of top-k queries, and further reduce the number of candidate sets. Finally, we develop a sampling algorithm to estimate top-k probabilities of candidates. Extensive experiments are conducted to verify the effectiveness and efficiency of the proposed methods.

AB - Top-k queries in distributed databases have been studied widely in recent years. There exists an inherent uncertainty on the data objects due to imprecise measurements and network delays. In this paper, based on horizontally distributed data among peers, we propose an efficient approach of processing uncertain top-k queries in P2P networks. Firstly, we construct a distributed index using Quad-tree, and based on the index, propose a spatial pruning algorithm. Secondly, we propose the upper bound of top-k probabilistic according to the relationship between local top-k probabilities and global top-k probabilities. We also propose the lower bound of top-k probabilities according to the relationship between skyline probabilities and top-k probabilities. Using the two probabilistic pruning algorithms, we can further reduce computation costs and network overhead of top-k queries, and further reduce the number of candidate sets. Finally, we develop a sampling algorithm to estimate top-k probabilities of candidates. Extensive experiments are conducted to verify the effectiveness and efficiency of the proposed methods.

KW - P2P

KW - Quad-tree

KW - Skyline probability

KW - Top-k query

KW - Uncertain data

UR - http://www.scopus.com/inward/record.url?scp=83055167093&partnerID=8YFLogxK

U2 - 10.3724/SP.J.1016.2011.02155

DO - 10.3724/SP.J.1016.2011.02155

M3 - Article

AN - SCOPUS:83055167093

SN - 0254-4164

VL - 34

SP - 2155

EP - 2164

JO - Jisuanji Xuebao/Chinese Journal of Computers

JF - Jisuanji Xuebao/Chinese Journal of Computers

IS - 11

ER -

Answering probabilistic top-k queries over P2P networks

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this