Optimization of an integrated model for automatic reduction and expansion of long queries

Dawei Song, Yanjie Shi, Peng Zhang, Yuexian Hou, Bin Hu, Yuan Jia, Qiang Huang, Udo Kruschwitz, Anne De Roeck, Peter Bruza

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

A long query provides more useful hints for searching relevant documents, but it is likely to introduce noise which affects retrieval performance. In order to smooth such adverse effect, it is important to reduce noisy terms, introduce and boost additional relevant terms. This paper presents a comprehensive framework, called Aspect Hidden Markov Model (AHMM), which integrates query reduction and expansion, for retrieval with long queries. It optimizes the probability distribution of query terms by utilizing intra-query term dependencies as well as the relationships between query terms and words observed in relevance feedback documents. Empirical evaluation on three large-scale TREC collections demonstrates that our approach, which is automatic, achieves salient improvements over various strong baselines, and also reaches a comparable performance to a state of the art method based on user's interactive query term reduction and expansion.

Original languageEnglish
Title of host publicationInformation Retrieval Technology - 9th Asia Information Retrieval Societies Conference, AIRS 2013, Proceedings
Pages133-144
Number of pages12
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event9th Asia Information Retrieval Societies Conference on Information Retrieval Technology, AIRS 2013 - Singapore, Singapore
Duration: 9 Dec 201311 Dec 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8281 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th Asia Information Retrieval Societies Conference on Information Retrieval Technology, AIRS 2013
Country/TerritorySingapore
CitySingapore
Period9/12/1311/12/13

Fingerprint

Dive into the research topics of 'Optimization of an integrated model for automatic reduction and expansion of long queries'. Together they form a unique fingerprint.

Cite this