Document re-ranking by generality in bio-medical information retrieval

Xin Yan*, Xue Li, Dawei Song

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Document ranking is an important process in information retrieval (IR). It presents retrieved documents in an order of their estimated degrees of relevance to query. Traditional document ranking methods are mostly based on the similarity computations between documents and query. In this paper we argue that the similarity-based document ranking is insufficient in some cases. There are two reasons. Firstly it is about the increased information variety. There are far too many different types documents available now for user to search. The second is about the users variety. In many cases user may want to retrieve documents that are not only similar but also general or broad regarding a certain topic. This is particularly the case in some domains such as bio-medical IR. In this paper we propose a novel approach to re-rank the retrieved documents by incorporating the similarity with their generality. By an ontology-based analysis on the semantic cohesion of text, document generality can be quantified. The retrieved documents are then re-ranked by their combined scores of similarity and the closeness of documents' generality to the query's. Our experiments have shown an encouraging performance on a large bio-medical document collection, OHSUMED, containing 348,566 medical journal references and 101 test queries.

源语言英语
主期刊名Web Information Systems Engineering, WISE 2005 - 6th International Conference on Web Information Systems Engineering, Proceedings
出版商Springer Verlag
376-389
页数14
ISBN(印刷版)3540300171, 9783540300175
DOI
出版状态已出版 - 2005
已对外发布
活动6th International Conference on Web Information Systems Engineering, WISE 2005 - New York, NY, 美国
期限: 20 11月 200522 11月 2005

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
3806 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议6th International Conference on Web Information Systems Engineering, WISE 2005
国家/地区美国
New York, NY
时期20/11/0522/11/05

指纹

探究 'Document re-ranking by generality in bio-medical information retrieval' 的科研主题。它们共同构成独一无二的指纹。

引用此