A novel method for clustering web search results with wikipedia disambiguation pages

Zhi Huang, Zhendong Niu*, Donglei Liu, Wenjuan Niu, Wei Wang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Organizing search results of an ambiguous query into topics can facilitate information search on the Web. In this paper, we propose a novel method to cluster search results of ambiguous query into topics about the query constructed from Wikipedia disambiguation pages (WDP). To improve the clustering result, we propose a concept filtering method to filter semantically unrelated concepts in each topic. Also, we propose the top K full relations (TKFR) algorithm to assign search results to relevant topics based on the similarities between concepts in the results and topics. Comparing with the clustering methods whose topic labels are extracted from search results, the topics of WDP which are edited by human are much more helpful for navigation. The experiment results show that our method can work for ambiguous queries with different query lengths and highly improves the clustering result of method using WDP.

Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications - DASFAA 2015 International Workshops, SeCoP, BDMS, and Posters, Revised Selected Papers
EditorsYoshiharu Ishikawa, Sarana Nutanong, An Liu, Tieyun Qian, Muhammad Aamir Cheema
PublisherSpringer Verlag
Pages3-16
Number of pages14
ISBN (Print)9783319223230
DOIs
Publication statusPublished - 2015
Event2nd International Workshop on Semantic Computing and Personalization, SeCoP 2015, 2nd International Workshop on Big Data Management and Service, BDMS 2015 held in conjunction with 20th International Conference on Database Systems for Advanced Applications, DASFAA 2015 - Hanoi, Viet Nam
Duration: 20 Apr 201523 Apr 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9052
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference2nd International Workshop on Semantic Computing and Personalization, SeCoP 2015, 2nd International Workshop on Big Data Management and Service, BDMS 2015 held in conjunction with 20th International Conference on Database Systems for Advanced Applications, DASFAA 2015
Country/TerritoryViet Nam
CityHanoi
Period20/04/1523/04/15

Keywords

  • Ambiguous query
  • Concepts filtering
  • Web search result clustering
  • Wikipedia disambiguation pages

Fingerprint

Dive into the research topics of 'A novel method for clustering web search results with wikipedia disambiguation pages'. Together they form a unique fingerprint.

Cite this