Leveraging pattern associations for word embedding models

Qian Liu, Heyan Huang, Yang Gao*, Xiaochi Wei, Ruiying Geng

*此作品的通讯作者

科研成果: 期刊稿件会议文章同行评审

2 引用 (Scopus)

摘要

Word embedding method has been shown powerful to capture words association, and facilitated numerous applications by effectively bridging lexical gaps. Word semantic is encoded with vectors and modeled based on n-gram language models, as a result it only takes into consideration of words co-occurrences in a shallow slide windows. However, the assumption of the language modelling ignores valuable associations between words in a long distance beyond n-gram coverage. In this paper, we argue that it is beneficial to jointly modeling both surrounding context and flexible associative patterns so that the model can cover long distance and intensive association. We propose a novel approach to combine associated patterns for word embedding method via joint training objection. We apply our model for query expansion in document retrieval task. Experimental results show that the proposed method can perform significantly better than the state-of-the-arts baseline models.

指纹

探究 'Leveraging pattern associations for word embedding models' 的科研主题。它们共同构成独一无二的指纹。

引用此