Word clustering based on word2vec and semantic similarity

Jie Luo*, Qinglin Wang, Yuan Li

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Citations (Scopus)

Abstract

Domain words clustering have important theoretical and practical significance in text categorization, the ontology research, machine learning and many other research areas. The domain words clustering method in this article is a method based on word2vec and semantic similarity computation. First of all, we get the candidate word set with word2vec tools to preliminary clustering of words. Then we tectonic domain category semantic core word set and screening candidate word set by means of semantic similarity computation. Finally we get new word set belongs to the target domain and get the word set in the field of clustering. Experiments show that this method has higher recall ratio and accuracy.

Original languageEnglish
Title of host publicationProceedings of the 33rd Chinese Control Conference, CCC 2014
EditorsShengyuan Xu, Qianchuan Zhao
PublisherIEEE Computer Society
Pages517-521
Number of pages5
ISBN (Electronic)9789881563842
DOIs
Publication statusPublished - 11 Sept 2014
EventProceedings of the 33rd Chinese Control Conference, CCC 2014 - Nanjing, China
Duration: 28 Jul 201430 Jul 2014

Publication series

NameProceedings of the 33rd Chinese Control Conference, CCC 2014
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

ConferenceProceedings of the 33rd Chinese Control Conference, CCC 2014
Country/TerritoryChina
CityNanjing
Period28/07/1430/07/14

Keywords

  • domain ontology
  • semantic similarity
  • word clustering
  • word2vec

Fingerprint

Dive into the research topics of 'Word clustering based on word2vec and semantic similarity'. Together they form a unique fingerprint.

Cite this