A topic based document relevance ranking model

Yang Gao, Yue Xu, Yuefeng Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Topic modelling has been widely used in the fields of in- formation retrieval, text mining, machine learning, etc. In this paper, we propose a novel model, Pattern Enhanced Topic Model (PETM), which makes improvements to topic modelling by semantically representing topics with discrim- inative patterns, and also makes innovative contributions to information filtering by utilising the proposed PETM to de- Termine document relevance based on topics distribution and maximum matched patterns proposed in this paper. Exten- sive experiments are conducted to evaluate the effectiveness of PETM by using the TREC data collection Reuters Corpus Volume 1. The results show that the proposed model signifi- cantly outperforms both state-of-the-art term-based models and pattern-based models.

Original languageEnglish
Title of host publicationWWW 2014 Companion - Proceedings of the 23rd International Conference on World Wide Web
PublisherAssociation for Computing Machinery, Inc
Pages271-272
Number of pages2
ISBN (Electronic)9781450327459
DOIs
Publication statusPublished - 7 Apr 2014
Externally publishedYes
Event23rd International Conference on World Wide Web, WWW 2014 - Seoul, Korea, Republic of
Duration: 7 Apr 201411 Apr 2014

Publication series

NameWWW 2014 Companion - Proceedings of the 23rd International Conference on World Wide Web

Conference

Conference23rd International Conference on World Wide Web, WWW 2014
Country/TerritoryKorea, Republic of
CitySeoul
Period7/04/1411/04/14

Keywords

  • Pattern mining
  • Relevance ranking
  • Topic models

Fingerprint

Dive into the research topics of 'A topic based document relevance ranking model'. Together they form a unique fingerprint.

Cite this