TY - GEN
T1 - A novel method for clustering web search results with wikipedia disambiguation pages
AU - Huang, Zhi
AU - Niu, Zhendong
AU - Liu, Donglei
AU - Niu, Wenjuan
AU - Wang, Wei
N1 - Publisher Copyright:
© Springer International Publishing Switzerland 2015.
PY - 2015
Y1 - 2015
N2 - Organizing search results of an ambiguous query into topics can facilitate information search on the Web. In this paper, we propose a novel method to cluster search results of ambiguous query into topics about the query constructed from Wikipedia disambiguation pages (WDP). To improve the clustering result, we propose a concept filtering method to filter semantically unrelated concepts in each topic. Also, we propose the top K full relations (TKFR) algorithm to assign search results to relevant topics based on the similarities between concepts in the results and topics. Comparing with the clustering methods whose topic labels are extracted from search results, the topics of WDP which are edited by human are much more helpful for navigation. The experiment results show that our method can work for ambiguous queries with different query lengths and highly improves the clustering result of method using WDP.
AB - Organizing search results of an ambiguous query into topics can facilitate information search on the Web. In this paper, we propose a novel method to cluster search results of ambiguous query into topics about the query constructed from Wikipedia disambiguation pages (WDP). To improve the clustering result, we propose a concept filtering method to filter semantically unrelated concepts in each topic. Also, we propose the top K full relations (TKFR) algorithm to assign search results to relevant topics based on the similarities between concepts in the results and topics. Comparing with the clustering methods whose topic labels are extracted from search results, the topics of WDP which are edited by human are much more helpful for navigation. The experiment results show that our method can work for ambiguous queries with different query lengths and highly improves the clustering result of method using WDP.
KW - Ambiguous query
KW - Concepts filtering
KW - Web search result clustering
KW - Wikipedia disambiguation pages
UR - http://www.scopus.com/inward/record.url?scp=84949982943&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-22324-7_1
DO - 10.1007/978-3-319-22324-7_1
M3 - Conference contribution
AN - SCOPUS:84949982943
SN - 9783319223230
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 3
EP - 16
BT - Database Systems for Advanced Applications - DASFAA 2015 International Workshops, SeCoP, BDMS, and Posters, Revised Selected Papers
A2 - Ishikawa, Yoshiharu
A2 - Nutanong, Sarana
A2 - Liu, An
A2 - Qian, Tieyun
A2 - Cheema, Muhammad Aamir
PB - Springer Verlag
T2 - 2nd International Workshop on Semantic Computing and Personalization, SeCoP 2015, 2nd International Workshop on Big Data Management and Service, BDMS 2015 held in conjunction with 20th International Conference on Database Systems for Advanced Applications, DASFAA 2015
Y2 - 20 April 2015 through 23 April 2015
ER -