A study of collection-based features for adapting the balance parameter in pseudo relevance feedback

Ye Meng, Peng Zhang, Dawei Song*, Yuexian Hou

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Pseudo-relevance feedback (PRF) is an effective technique to improve the ad-hoc retrieval performance. For PRF methods, how to optimize the balance parameter between the original query model and feedback model is an important but difficult problem. Traditionally, the balance parameter is often manually tested and set to a fixed value across collections and queries. However, due to the difference among collections and individual queries, this parameter should be tuned differently. Recent research has studied various query based and feedback documents based features to predict the optimal balance parameter for each query on a specific collection, through a learning approach based on logistic regression. In this paper, we hypothesize that characteristics of collections are also important for the prediction. We propose and systematically investigate a series of collection-based features for queries, feedback documents and candidate expansion terms. The experiments show that our method is competitive in improving retrieval performance and particularly for cross-collection prediction, in comparison with the state-of-the-art approaches.

Original languageEnglish
Title of host publicationInformation Retrieval Technology - 11th Asia Information Retrieval Societies Conference, AIRS 2015, Proceedings
EditorsFalk Scholer, Guido Zuccon, Shlomo Geva, Aixin Sun, Hideo Joho, Peng Zhang
PublisherSpringer Verlag
Pages265-276
Number of pages12
ISBN (Print)9783319289397
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event11th Asia Information Retrieval Societies Conference, AIRS 2015 - Brisbane, Australia
Duration: 2 Dec 20154 Dec 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9460
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference11th Asia Information Retrieval Societies Conference, AIRS 2015
Country/TerritoryAustralia
CityBrisbane
Period2/12/154/12/15

Keywords

  • Collection characteristics
  • Information retrieval
  • Pseudo-relevance feedback

Fingerprint

Dive into the research topics of 'A study of collection-based features for adapting the balance parameter in pseudo relevance feedback'. Together they form a unique fingerprint.

Cite this