A sequential latent topic-based readability model for domain-specific information retrieval

Wenya Zhang, Dawei Song, Peng Zhang*, Xiaozhao Zhao, Yuexian Hou

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 4
  • Captures
    • Readers: 3
see details

摘要

In domain-specific information retrieval (IR), an emerging problem is how to provide different users with documents that are both relevant and readable, especially for the lay users. In this paper, we propose a novel document readability model to enhance the domain-specific IR. Our model incorporates the coverage and sequential dependency of latent topics in a document. Accordingly, two topical readability indicators, namely Topic Scope and Topic Trace are developed. These indicators, combined with the classical Surface-level indicator, can be used to rerank the initial list of documents returned by a conventional search engine. In order to extract the structured latent topics without supervision, the hierarchical Latent Dirichlet Allocation (hLDA) is used. We have evaluated our model from the user-oriented and system-oriented perspectives, in the medical domain. The user-oriented evaluation shows a good correlation between the readability scores given by our model and human judgments. Furthermore, our model also gains significant improvement in the system-oriented evaluation in comparison with one of the state-of-the-art readability methods.

源语言英语
主期刊名Information Retrieval Technology - 11th Asia Information Retrieval Societies Conference, AIRS 2015, Proceedings
编辑Falk Scholer, Guido Zuccon, Shlomo Geva, Aixin Sun, Hideo Joho, Peng Zhang
出版商Springer Verlag
241-252
页数12
ISBN(印刷版)9783319289397
DOI
出版状态已出版 - 2015
已对外发布
活动11th Asia Information Retrieval Societies Conference, AIRS 2015 - Brisbane, 澳大利亚
期限: 2 12月 20154 12月 2015

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
9460
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议11th Asia Information Retrieval Societies Conference, AIRS 2015
国家/地区澳大利亚
Brisbane
时期2/12/154/12/15

指纹

探究 'A sequential latent topic-based readability model for domain-specific information retrieval' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhang, W., Song, D., Zhang, P., Zhao, X., & Hou, Y. (2015). A sequential latent topic-based readability model for domain-specific information retrieval. 在 F. Scholer, G. Zuccon, S. Geva, A. Sun, H. Joho, & P. Zhang (编辑), Information Retrieval Technology - 11th Asia Information Retrieval Societies Conference, AIRS 2015, Proceedings (页码 241-252). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 9460). Springer Verlag. https://doi.org/10.1007/978-3-319-28940-3_19