基于词向量的中文事件发现及表示

Bin Zhang, Linmei Hu, Lei Hou*, Juanzi Li

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

Existing methods of event detection are mainly based on traditional TF-IDF document representation with high dimension and sparse semantics, leading to low efficiency and accuracy. Thus, they are not suitable for large-scale online news event detection. A document representation method based on word embedding is proposed in this paper. By the document representation method, the document representation dimension is reduced, the semantic sparse problem is alleviated and the efficiency and accuracy of document similarity calculation are enhanced. Based on the document representation method, a dynamic online clustering method is proposed for online news event detection. Based on the dynamic online clustering method, both the accuracy and the recall of event detection are improved. Experiments on the standard dataset TDT4 and a real dataset show that the proposed adaptive online event detection method significantly improves the performance of event detection in both efficiency and accuracy compared with the state-of-the-art methods.

投稿的翻译标题Word Embedding Based Chinese News Event Detection and Representation
源语言繁体中文
页(从-至)275-282
页数8
期刊Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence
31
3
DOI
出版状态已出版 - 1 3月 2018
已对外发布

关键词

  • Dynamic Online Clustering
  • Event Detection
  • Word Embedding

指纹

探究 '基于词向量的中文事件发现及表示' 的科研主题。它们共同构成独一无二的指纹。

引用此