Crowdsourcing database systems: Overview and challenges

Chengliang Chai, Ju Fan, Guoliang Li, Jiannan Wang, Yudian Zheng

科研成果: 书/报告/会议事项章节会议稿件同行评审

30 引用 (Scopus)

摘要

Many data management and analytics tasks, such as entity resolution, cannot be solely addressed by automated processes. Crowdsourcing is an effective way to harness the human cognitive ability to process these computer-hard tasks. Thanks to public crowdsourcing platforms, e.g., Amazon Mechanical Turk and CrowdFlower, we can easily involve hundreds of thousands of ordinary workers (i.e., the crowd) to address these computer-hard tasks. However it is rather inconvenient to interact with the crowdsourcing platforms, because the platforms require one to set parameters and even write codes. Inspired by traditional DBMS, crowdsourcing database systems have been proposed and widely studied to encapsulate the complexities of interacting with the crowd. In this tutorial, we will survey and synthesize a wide spectrum of existing studies on crowdsourcing database systems. We first give an overview of crowdsourcing, and then summarize the fundamental techniques in designing crowdsourcing databases, including task design, truth inference, task assignment, answer reasoning and latency reduction. Next we review the techniques on designing crowdsourced operators, including selection, join, sort, top-k, max/min, count, collect, and fill. Finally, we discuss the emerging challenges.

源语言英语
主期刊名Proceedings - 2019 IEEE 35th International Conference on Data Engineering, ICDE 2019
出版商IEEE Computer Society
2052-2055
页数4
ISBN(电子版)9781538674741
DOI
出版状态已出版 - 4月 2019
已对外发布
活动35th IEEE International Conference on Data Engineering, ICDE 2019 - Macau, 中国
期限: 8 4月 201911 4月 2019

出版系列

姓名Proceedings - International Conference on Data Engineering
2019-April
ISSN(印刷版)1084-4627

会议

会议35th IEEE International Conference on Data Engineering, ICDE 2019
国家/地区中国
Macau
时期8/04/1911/04/19

指纹

探究 'Crowdsourcing database systems: Overview and challenges' 的科研主题。它们共同构成独一无二的指纹。

引用此