TY - JOUR
T1 - An Improved K-Means Algorithm Based on Kurtosis Test
AU - Wang, Tingxuan
AU - Gao, Junyao
N1 - Publisher Copyright:
© Published under licence by IOP Publishing Ltd.
PY - 2019/7/17
Y1 - 2019/7/17
N2 - Clustering is a process of classifying data into different classes and has become an important tool in data mining. Among many clustering algorithms, the K-means clustering algorithm is widely used because of its simplicity and high efficiency. However, the traditional K-means algorithm can only find spherical clusters, and is also susceptible to noise points and isolated points, which makes the clustering results affected. To solve these problems, this paper proposes an improved K-means algorithm based on kurtosis test. The improved algorithm can improve the adaptability of clustering algorithm to complex shape datasets while reducing the impact of outlier data on clustering results, so that the algorithm results can be more accurate. The method used in our study is known as kurtosis test and Monte Carlo method. We validate our theoretical results in experiments on a variety of datasets. The experimental results show that the proposed algorithm has larger external indicators of clustering performance metrics, which means that the accuracy of clustering results is significantly improved.
AB - Clustering is a process of classifying data into different classes and has become an important tool in data mining. Among many clustering algorithms, the K-means clustering algorithm is widely used because of its simplicity and high efficiency. However, the traditional K-means algorithm can only find spherical clusters, and is also susceptible to noise points and isolated points, which makes the clustering results affected. To solve these problems, this paper proposes an improved K-means algorithm based on kurtosis test. The improved algorithm can improve the adaptability of clustering algorithm to complex shape datasets while reducing the impact of outlier data on clustering results, so that the algorithm results can be more accurate. The method used in our study is known as kurtosis test and Monte Carlo method. We validate our theoretical results in experiments on a variety of datasets. The experimental results show that the proposed algorithm has larger external indicators of clustering performance metrics, which means that the accuracy of clustering results is significantly improved.
UR - http://www.scopus.com/inward/record.url?scp=85069969298&partnerID=8YFLogxK
U2 - 10.1088/1742-6596/1267/1/012027
DO - 10.1088/1742-6596/1267/1/012027
M3 - Conference article
AN - SCOPUS:85069969298
SN - 1742-6588
VL - 1267
JO - Journal of Physics: Conference Series
JF - Journal of Physics: Conference Series
IS - 1
M1 - 012027
T2 - 2019 3rd International Conference on Artificial Intelligence, Automation and Control Technologies, AIACT 2019
Y2 - 25 April 2019 through 27 April 2019
ER -