DVT-PKM: An improved GPU based parallel K-means algorithm

Bo Yan*, Ye Zhang, Zijiang Yang, Hongyi Su, Hong Zheng

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Citations (Scopus)

Abstract

K-Means clustering algorithm is a typical partition-based clustering algorithm. Its two major disadvantages lie in the facts that the algorithm is sensitive to initial cluster centers and the outliers exert significant influence on the clustering results. In addition, K-Means algorithm traverses and computes all the data multiple times. Thus, the algorithm is not efficient when dealing with large data sets. In order to overcome the above limitations, this paper proposes to exclude the outliers using the minimum number of points in the d-dimensional hypersphere area. Then k cluster centers can be obtained by adjusting the threshold making use of density idea. Finally, K-Means algorithm will be integrated with Compute Unified Device Architecture (CUDA). The time efficiency is improved considerably through taking advantage of computing power of Graphic Processing Unit (GPU). We use the ratio of distance between classes to distance within classes and speedup as the evaluation criteria. The experiments indicate that the proposed algorithm significantly improves the stability and running efficiency of K-Means algorithm.

Original languageEnglish
Title of host publicationIntelligent Computing Methodologies - 10th International Conference, ICIC 2014, Proceedings
PublisherSpringer Verlag
Pages591-601
Number of pages11
ISBN (Print)9783319093383
DOIs
Publication statusPublished - 2014
Event10th International Conference on Intelligent Computing, ICIC 2014 - Taiyuan, China
Duration: 3 Aug 20146 Aug 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8589 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference10th International Conference on Intelligent Computing, ICIC 2014
Country/TerritoryChina
CityTaiyuan
Period3/08/146/08/14

Keywords

  • Graphic Processing Unit
  • K-Means
  • density

Fingerprint

Dive into the research topics of 'DVT-PKM: An improved GPU based parallel K-means algorithm'. Together they form a unique fingerprint.

Cite this