Query intent detection based on clustering of phrase embedding

Jiahui Gu*, Chong Feng, Xiong Gao, Yashen Wang, Heyan Huang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

Understanding ambiguous or multi-faceted search queries is essential for information retrieval. The task of identifying the major aspects or senses of queries can be viewed as detection of query intents, where the intents are represented as a number of clusters. So the challenging issue in this task is how to generate intent candidates and group them semantically. This paper explores the competence of lexical statistics and embedding method. First a novel term expansion algorithm is designed to sketch all possible intent candidates. Moreover, an efficient query intent generation model is proposed, which learns latent representations for intent candidates via embedding-based methods. And then vectorized intent candidates are clustered and detected as query intents. Experimental results, based on the NTCIR-12 IMine-2 corpus, show that query intent generation model via phrase embedding significantly outperforms the state-of-art clustering algorithms in query intent detection.

源语言英语
主期刊名Social Media Processing - 5th National Conference, SMP 2016, Proceedings
编辑Hongfei Lin, Yuming Li, Guoxiong Xiang, Mingwen Wang
出版商Springer Verlag
110-122
页数13
ISBN(印刷版)9789811029929
DOI
出版状态已出版 - 2016
活动5th National Conference on Social Media Processing, SMP 2016 - Nanchang, 中国
期限: 29 10月 201630 10月 2016

出版系列

姓名Communications in Computer and Information Science
669
ISSN(印刷版)1865-0929

会议

会议5th National Conference on Social Media Processing, SMP 2016
国家/地区中国
Nanchang
时期29/10/1630/10/16

指纹

探究 'Query intent detection based on clustering of phrase embedding' 的科研主题。它们共同构成独一无二的指纹。

引用此