TY - GEN
T1 - Integrating document features for entity ranking
AU - Zhu, Jianhan
AU - Song, Dawei
AU - Rüger, Stefan
PY - 2008
Y1 - 2008
N2 - The Knowledge Media Institute of the Open University participated in the entity ranking and entity list completion tasks of the Entity Ranking Track in INEX 2007. In both the entity ranking and entity list completion tasks, we have considered document features in addition to a basic document content based relevance model. These document features include categorizations of documents, relevance of category names to the query, and hierarchical relations between categories. Furthermore, based on our TREC2006 and 2007 expert search approach, we applied a co-occurrence based entity association discovery model to the two tasks based on the assumption that relevant entities often co-occur with query terms or given relevant entities in documents. Our initial experimental results show that, by considering the predefined category, its children and grandchildren in the document content based relevance model, the performance of our entity ranking approach can be significantly improved. Consideration of the predefined category's parents, a category name based relevance model, and the co-occurrence model is not shown to be helpful in entity ranking and list completion, respectively.
AB - The Knowledge Media Institute of the Open University participated in the entity ranking and entity list completion tasks of the Entity Ranking Track in INEX 2007. In both the entity ranking and entity list completion tasks, we have considered document features in addition to a basic document content based relevance model. These document features include categorizations of documents, relevance of category names to the query, and hierarchical relations between categories. Furthermore, based on our TREC2006 and 2007 expert search approach, we applied a co-occurrence based entity association discovery model to the two tasks based on the assumption that relevant entities often co-occur with query terms or given relevant entities in documents. Our initial experimental results show that, by considering the predefined category, its children and grandchildren in the document content based relevance model, the performance of our entity ranking approach can be significantly improved. Consideration of the predefined category's parents, a category name based relevance model, and the co-occurrence model is not shown to be helpful in entity ranking and list completion, respectively.
KW - Categories
KW - Entity ranking
KW - Entity retrieval
KW - List completion
UR - http://www.scopus.com/inward/record.url?scp=51849126581&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-85902-4_29
DO - 10.1007/978-3-540-85902-4_29
M3 - Conference contribution
AN - SCOPUS:51849126581
SN - 3540859012
SN - 9783540859017
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 336
EP - 347
BT - Focused Access to XML Documents - 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Revised and Selected Papers
T2 - 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007
Y2 - 17 December 2007 through 19 December 2007
ER -