跳到主要导航 跳到搜索 跳到主要内容

A study of per-topic variance on system comparison

  • Meng Yang
  • , Peng Zhang*
  • , Dawei Song
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Under the notion that the document collection is a sample from a population, the observed per-topic metric (e.g., AP) value varies with different samples, leading to the per-topic variance. The results of the system comparison, such as comparing the ranking of systems according to the summary metric (e.g., MAP) or testing whether there is significant difference between two systems, are affected by the variability of per-topic metric values. In this paper, we study the effect of per-topic variance on the system comparison. To measure such effects, we employ two ranking-based methods, i.e., Error Rate (ER) and Kendall Rank Correlation Coefficient (KRCC), as well as two significance test based methods, namely Achieved Significance Level (ASL) and Estimated Difference (ED). We conduct empirical comparison of TREC participated systems on Robust and Adhoc track, which shows that the effect of per-topic variance on the ranking of systems is not obvious, while the significance test based comparisons are susceptible to the per-topic variance.

源语言英语
主期刊名41st International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2018
出版商Association for Computing Machinery, Inc
1181-1184
页数4
ISBN(电子版)9781450356572
DOI
出版状态已出版 - 27 6月 2018
已对外发布
活动41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2018 - Ann Arbor, 美国
期限: 8 7月 201812 7月 2018

出版系列

姓名41st International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2018

会议

会议41st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2018
国家/地区美国
Ann Arbor
时期8/07/1812/07/18

指纹

探究 'A study of per-topic variance on system comparison' 的科研主题。它们共同构成独一无二的指纹。

引用此