CDB: Optimizing queries with crowd-based selections and joins

Guoliang Li, Chengliang Chai, Ju Fan, Xueping Weng, Jian Li, Yudian Zheng, Yuanbing Li, Xiang Yu, Xiaohang Zhang, Haitao Yuan

科研成果: 书/报告/会议事项章节会议稿件同行评审

68 引用 (Scopus)

摘要

Crowdsourcing database systems have been proposed to leverage crowd-powered operations to encapsulate the complexities of interacting with the crowd. Existing systems suffer from two major limitations. Firstly, in order to optimize a query, they often adopt the traditional tree model to select an optimized table-level join order. However, the tree model provides a coarse-grained optimization, which generates the same order for different joined tuples and limits the optimization potential that different joined tuples can be optimized by different orders. Secondly, they mainly focus on optimizing the monetary cost. In fact, there are three optimization goals (i.e., smaller monetary cost, lower latency, and higher quality) in crowdsourcing, and it calls for a system to enable multi-goal optimization. To address the limitations, we develop a crowd-powered database system CDB that supports crowd-based query optimizations, with focus on join and selection. CDB has fundamental differences from existing systems. First, CDB employs a graph-based query model that provides more fine-grained query optimization. Second, CDB adopts a unified framework to perform the multi-goal optimization based on the graph model. We have implemented our system and deployed it on AMT, CrowdFlower and ChinaCrowd. We have also created a benchmark for evaluating crowd-powered databases. We have conducted both simulated and real experiments, and the experimental results demonstrate the performance superiority of CDB on cost, latency and quality.

源语言英语
主期刊名SIGMOD 2017 - Proceedings of the 2017 ACM International Conference on Management of Data
出版商Association for Computing Machinery
1463-1478
页数16
ISBN(电子版)9781450341974
DOI
出版状态已出版 - 9 5月 2017
已对外发布
活动2017 ACM SIGMOD International Conference on Management of Data, SIGMOD 2017 - Chicago, 美国
期限: 14 5月 201719 5月 2017

出版系列

姓名Proceedings of the ACM SIGMOD International Conference on Management of Data
Part F127746
ISSN(印刷版)0730-8078

会议

会议2017 ACM SIGMOD International Conference on Management of Data, SIGMOD 2017
国家/地区美国
Chicago
时期14/05/1719/05/17

指纹

探究 'CDB: Optimizing queries with crowd-based selections and joins' 的科研主题。它们共同构成独一无二的指纹。

引用此