DVT-PKM: An improved GPU based parallel K-means algorithm

Bo Yan*, Ye Zhang, Zijiang Yang, Hongyi Su, Hong Zheng

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

7 引用 (Scopus)

摘要

K-Means clustering algorithm is a typical partition-based clustering algorithm. Its two major disadvantages lie in the facts that the algorithm is sensitive to initial cluster centers and the outliers exert significant influence on the clustering results. In addition, K-Means algorithm traverses and computes all the data multiple times. Thus, the algorithm is not efficient when dealing with large data sets. In order to overcome the above limitations, this paper proposes to exclude the outliers using the minimum number of points in the d-dimensional hypersphere area. Then k cluster centers can be obtained by adjusting the threshold making use of density idea. Finally, K-Means algorithm will be integrated with Compute Unified Device Architecture (CUDA). The time efficiency is improved considerably through taking advantage of computing power of Graphic Processing Unit (GPU). We use the ratio of distance between classes to distance within classes and speedup as the evaluation criteria. The experiments indicate that the proposed algorithm significantly improves the stability and running efficiency of K-Means algorithm.

源语言英语
主期刊名Intelligent Computing Methodologies - 10th International Conference, ICIC 2014, Proceedings
出版商Springer Verlag
591-601
页数11
ISBN(印刷版)9783319093383
DOI
出版状态已出版 - 2014
活动10th International Conference on Intelligent Computing, ICIC 2014 - Taiyuan, 中国
期限: 3 8月 20146 8月 2014

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
8589 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议10th International Conference on Intelligent Computing, ICIC 2014
国家/地区中国
Taiyuan
时期3/08/146/08/14

指纹

探究 'DVT-PKM: An improved GPU based parallel K-means algorithm' 的科研主题。它们共同构成独一无二的指纹。

引用此