CrowdOp: Query Optimization for Declarative Crowdsourcing Systems

Ju Fan, Meihui Zhang, Stanley Kok, Meiyu Lu, Beng Chin Ooi

Research output: Contribution to journalArticlepeer-review

45 Citations (Scopus)

Abstract

We study the query optimization problem in declarative crowdsourcing systems. Declarative crowdsourcing is designed to hide the complexities and relieve the user of the burden of dealing with the crowd. The user is only required to submit an SQL-like query and the system takes the responsibility of compiling the query, generating the execution plan and evaluating in the crowdsourcing marketplace. A given query can have many alternative execution plans and the difference in crowdsourcing cost between the best and the worst plans may be several orders of magnitude. Therefore, as in relational database systems, query optimization is important to crowdsourcing systems that provide declarative query interfaces. In this paper, we propose CrowdOp, a cost-based query optimization approach for declarative crowdsourcing systems. CrowdOp considers both cost and latency in query optimization objectives and generates query plans that provide a good balance between the cost and latency. We develop efficient algorithms in the CrowdOp for optimizing three types of queries: selection queries, join queries, and complex selection-join queries. We validate our approach via extensive experiments by simulation as well as with the real crowd on Amazon Mechanical Turk.

Original languageEnglish
Article number7052378
Pages (from-to)2078-2092
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume27
Issue number8
DOIs
Publication statusPublished - 1 Aug 2015
Externally publishedYes

Keywords

  • Crowdsourcing
  • Query Optimization

Fingerprint

Dive into the research topics of 'CrowdOp: Query Optimization for Declarative Crowdsourcing Systems'. Together they form a unique fingerprint.

Cite this