Generalized bias-variance evaluation of TREC participated systems

Peng Zhang, Linxue Hao, Dawei Song, Jun Wang, Yuexian Hou, Bin Hu

科研成果: 书/报告/会议事项章节会议稿件同行评审

6 引用 (Scopus)

摘要

Recent research has shown that the improvement of mean retrieval effectiveness (e.g., MAP) may sacrifice the retrieval stability across queries, implying a tradeoff between effectiveness and stability. The evaluation of both effectiveness and stability are often based on a baseline model, which could be weak or biased. In addition, the effectiveness-stability tradeoff has not been systematically or quantitatively evaluated over TREC participated systems. The above two problems, to some extent, limit our awareness of such tradeoff and its impact on developing future IR models. In this paper, motivated by a recently proposed bias-variance based evaluation, we adopt a strong and unbiased "baseline", which is a virtual target model constructed by the best performance (for each query) among all the participated systems in a retrieval task. We also propose generalized bias-variance metrics, based on which a systematic and quantitative evaluation of the effectiveness-stability tradeoff is carried out over the participated systems in the TREC Ad-hoc Track (1993-1999) and Web Track (2010-2012). We observe a clear effectiveness-stability tradeoff, with a trend of becoming more obvious in more recent years. This implies that when we pursue more effective IR systems over years, the stability has become problematic and could have been largely overlooked.

源语言英语
主期刊名CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management
出版商Association for Computing Machinery
1911-1914
页数4
ISBN(电子版)9781450325981
DOI
出版状态已出版 - 3 11月 2014
已对外发布
活动23rd ACM International Conference on Information and Knowledge Management, CIKM 2014 - Shanghai, 中国
期限: 3 11月 20147 11月 2014

出版系列

姓名CIKM 2014 - Proceedings of the 2014 ACM International Conference on Information and Knowledge Management

会议

会议23rd ACM International Conference on Information and Knowledge Management, CIKM 2014
国家/地区中国
Shanghai
时期3/11/147/11/14

指纹

探究 'Generalized bias-variance evaluation of TREC participated systems' 的科研主题。它们共同构成独一无二的指纹。

引用此