Cluster center initialization parallel algorithm for K-Means algorithm

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

K-Means algorithm is a one of the most famous unsupervised clustering algorithm. It has many disadvantages, such as sensitivity to the initial clustering centers and computes all the data points multiple times when facing the increasing data volume. In order to overcome the above limitations, this paper proposes to make use of density idea to find k cluster centers by adjusting the threshold. Finally, we design and implementation of the K-Means algorithm on the modern Graphic Processing Unit (GPU). The ratio of distance between classes to distance within classes and speedup are used as evaluation criteria. The experiments indicate that the proposed algorithm significantly improves the stability and efficiency of K-Means algorithm.

Original languageEnglish
Title of host publicationMaterials Science, Computer and Information Technology
PublisherTrans Tech Publications Ltd.
Pages2169-2172
Number of pages4
ISBN (Print)9783038351733
DOIs
Publication statusPublished - 2014
Event4th International Conference on Materials Science and Information Technology, MSIT 2014 - Tianjin, China
Duration: 14 Jun 201415 Jun 2014

Publication series

NameAdvanced Materials Research
Volume989-994
ISSN (Print)1022-6680
ISSN (Electronic)1662-8985

Conference

Conference4th International Conference on Materials Science and Information Technology, MSIT 2014
Country/TerritoryChina
CityTianjin
Period14/06/1415/06/14

Keywords

  • Density
  • GPU
  • K-means
  • Parallel

Fingerprint

Dive into the research topics of 'Cluster center initialization parallel algorithm for K-Means algorithm'. Together they form a unique fingerprint.

Cite this