TY - GEN
T1 - Optimization of an integrated model for automatic reduction and expansion of long queries
AU - Song, Dawei
AU - Shi, Yanjie
AU - Zhang, Peng
AU - Hou, Yuexian
AU - Hu, Bin
AU - Jia, Yuan
AU - Huang, Qiang
AU - Kruschwitz, Udo
AU - De Roeck, Anne
AU - Bruza, Peter
PY - 2013
Y1 - 2013
N2 - A long query provides more useful hints for searching relevant documents, but it is likely to introduce noise which affects retrieval performance. In order to smooth such adverse effect, it is important to reduce noisy terms, introduce and boost additional relevant terms. This paper presents a comprehensive framework, called Aspect Hidden Markov Model (AHMM), which integrates query reduction and expansion, for retrieval with long queries. It optimizes the probability distribution of query terms by utilizing intra-query term dependencies as well as the relationships between query terms and words observed in relevance feedback documents. Empirical evaluation on three large-scale TREC collections demonstrates that our approach, which is automatic, achieves salient improvements over various strong baselines, and also reaches a comparable performance to a state of the art method based on user's interactive query term reduction and expansion.
AB - A long query provides more useful hints for searching relevant documents, but it is likely to introduce noise which affects retrieval performance. In order to smooth such adverse effect, it is important to reduce noisy terms, introduce and boost additional relevant terms. This paper presents a comprehensive framework, called Aspect Hidden Markov Model (AHMM), which integrates query reduction and expansion, for retrieval with long queries. It optimizes the probability distribution of query terms by utilizing intra-query term dependencies as well as the relationships between query terms and words observed in relevance feedback documents. Empirical evaluation on three large-scale TREC collections demonstrates that our approach, which is automatic, achieves salient improvements over various strong baselines, and also reaches a comparable performance to a state of the art method based on user's interactive query term reduction and expansion.
UR - http://www.scopus.com/inward/record.url?scp=84893232934&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-45068-6_12
DO - 10.1007/978-3-642-45068-6_12
M3 - Conference contribution
AN - SCOPUS:84893232934
SN - 9783642450679
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 133
EP - 144
BT - Information Retrieval Technology - 9th Asia Information Retrieval Societies Conference, AIRS 2013, Proceedings
T2 - 9th Asia Information Retrieval Societies Conference on Information Retrieval Technology, AIRS 2013
Y2 - 9 December 2013 through 11 December 2013
ER -