TY - JOUR
T1 - An Online Inference-Aided Incentive Framework for Information Elicitation Without Verification
AU - Huang, Chao
AU - Yu, Haoran
AU - Huang, Jianwei
AU - Berry, Randall
N1 - Publisher Copyright:
© 1983-2012 IEEE.
PY - 2023/4/1
Y1 - 2023/4/1
N2 - We study the design of incentive mechanisms for the problem of information elicitation without verification (IEWV). In IEWV, a data requester seeks to design proper incentives to optimize the tradeoff between the quality of information (collected from distributed crowd workers) and the total cost of incentives (provided to crowd workers) without verifiable ground truth. While prior work often relies on sufficient knowledge of worker information, we study a scenario where the data requester cannot access workers' heterogeneous information quality and costs ex-ante. We propose a continuum-armed bandit-based incentive mechanism that dynamically learns the optimal reward level from workers' reported information. A key challenge is that the data requester cannot evaluate the workers' information quality without verification, which motivates the design of an inference algorithm. The inference problem is non-convex, yet we reformulate it as a bi-convex problem and derive an approximate solution with a performance guarantee, which ensures the effectiveness of our online reward design. We further enhance the inference algorithm using part of the workers' historical reports. We also propose a novel rule for the data requester to aggregate workers' solutions more effectively. We show that our mechanism achieves a sub-linear regret $\tilde {O}(T^{1/2})$ and outperforms several celebrated benchmarks.
AB - We study the design of incentive mechanisms for the problem of information elicitation without verification (IEWV). In IEWV, a data requester seeks to design proper incentives to optimize the tradeoff between the quality of information (collected from distributed crowd workers) and the total cost of incentives (provided to crowd workers) without verifiable ground truth. While prior work often relies on sufficient knowledge of worker information, we study a scenario where the data requester cannot access workers' heterogeneous information quality and costs ex-ante. We propose a continuum-armed bandit-based incentive mechanism that dynamically learns the optimal reward level from workers' reported information. A key challenge is that the data requester cannot evaluate the workers' information quality without verification, which motivates the design of an inference algorithm. The inference problem is non-convex, yet we reformulate it as a bi-convex problem and derive an approximate solution with a performance guarantee, which ensures the effectiveness of our online reward design. We further enhance the inference algorithm using part of the workers' historical reports. We also propose a novel rule for the data requester to aggregate workers' solutions more effectively. We show that our mechanism achieves a sub-linear regret $\tilde {O}(T^{1/2})$ and outperforms several celebrated benchmarks.
KW - Distributed learning
KW - game theory
KW - incentive mechanism design
KW - information elicitation without verification
UR - http://www.scopus.com/inward/record.url?scp=85148439630&partnerID=8YFLogxK
U2 - 10.1109/JSAC.2023.3242706
DO - 10.1109/JSAC.2023.3242706
M3 - Article
AN - SCOPUS:85148439630
SN - 0733-8716
VL - 41
SP - 1167
EP - 1185
JO - IEEE Journal on Selected Areas in Communications
JF - IEEE Journal on Selected Areas in Communications
IS - 4
ER -