A study of document weight smoothness in pseudo relevance feedback

Peng Zhang*, Dawei Song, Xiaochao Zhao, Yuexian Hou

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

In pseudo relevance feedback (PRF), the document weight which indicates how important a document is for the PRF model, plays a key role. In this paper, we investigate the smoothness issue of the document weights in PRF. The term smoothness means that the document weights decrease smoothly (i.e. gradually) along the document ranking list, and the weights are smooth (i.e. similar) within topically similar documents. We postulate that a reasonably smooth document-weighting function can benefit the PRF performance. This hypothesis is tested under a typical PRF model, namely the Relevance Model (RM). We propose a two-step document weight smoothing method, the different instantiations of which have different effects on weight smoothing. Experiments on three TREC collections show that the instantiated methods with better smoothing effects generally lead to better PRF performance. In addition, the proposed method can significantly improve the RM's performance and outperform various alternative methods which can also be used to smooth the document weights.

Original languageEnglish
Title of host publicationInformation Retrieval Technology - 6th Asia Information Retrieval Societies Conference, AIRS 2010, Proceedings
Pages527-538
Number of pages12
DOIs
Publication statusPublished - 2010
Externally publishedYes
Event6th Asia Information Retrieval Societies Conference, AIRS 2010 - Taipei, Taiwan, Province of China
Duration: 1 Dec 20103 Dec 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6458 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th Asia Information Retrieval Societies Conference, AIRS 2010
Country/TerritoryTaiwan, Province of China
CityTaipei
Period1/12/103/12/10

Keywords

  • Document weight smoothness
  • Pseudo relevance feedback
  • Query language model
  • Relevance Model

Fingerprint

Dive into the research topics of 'A study of document weight smoothness in pseudo relevance feedback'. Together they form a unique fingerprint.

Cite this