Incentive-based entity collection using crowdsourcing

Chengliang Chai, Ju Fan*, Guoliang Li

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

25 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 25
  • Captures
    • Readers: 14
see details

摘要

Crowdsourced entity collection leverages human's ability to collect entities that are missing in a database, which has many real-world applications, such as knowledge base enrichment and enterprise data collection. There are several challenges. First, it is hard to evaluate the workers' quality because a worker's quality depends on not only the correctness of her provided entities but also the distinctness of these entities compared with the collected ones by other workers. Second, crowd workers are likely to provide popular entities and different workers will provide many duplicated entities, leading to a waste of money and low coverage. To address these challenges, we propose an incentive-based crowdsourced entity collection framework CrowdEC that encourages workers to provide more distinct items using an incentive strategy. CrowdEC has fundamental differences from existing crowdsourcing collection methods. One the one hand, CrowdEC proposes a worker model and evaluates a worker's quality based on cross validation and entity checking. CrowdEC devises a worker utility model that considers both worker's quality and entities' distinctness provided by workers. CrowdEC proposes a worker elimination method to block workers with a low utility, which solves the first challenge. On the other hand, CrowdEC proposes an incentive pricing technique that encourages each qualified (i.e., non-eliminated) worker to provide distinct entities rather than duplicates. CrowdEC provides two types of tasks and judiciously assigns workers with appropriate tasks to address the second challenge. We have conducted both real and simulated experiments, and the results show that CrowdEC outperforms existing state-of-The-Art works on both cost and quality.

源语言英语
主期刊名Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018
出版商Institute of Electrical and Electronics Engineers Inc.
341-352
页数12
ISBN(电子版)9781538655207
DOI
出版状态已出版 - 24 10月 2018
已对外发布
活动34th IEEE International Conference on Data Engineering, ICDE 2018 - Paris, 法国
期限: 16 4月 201819 4月 2018

出版系列

姓名Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018

会议

会议34th IEEE International Conference on Data Engineering, ICDE 2018
国家/地区法国
Paris
时期16/04/1819/04/18

指纹

探究 'Incentive-based entity collection using crowdsourcing' 的科研主题。它们共同构成独一无二的指纹。

引用此

Chai, C., Fan, J., & Li, G. (2018). Incentive-based entity collection using crowdsourcing. 在 Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018 (页码 341-352). 文章 8509260 (Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDE.2018.00039