TY - JOUR
T1 - An improved method for density-based clustering
AU - Jin, Hong
AU - Wang, Shuliang
AU - Zhou, Qian
AU - Li, Ying
N1 - Publisher Copyright:
Copyright © 2014 Inderscience Enterprises Ltd.
PY - 2014/1/1
Y1 - 2014/1/1
N2 - Knowledge discovery in large multimedia databases which usually contain large amounts of noise and high-dimensional feature vectors is an increasingly important research issue. Density-based clustering is proved to be much more efficient when dealing with such databases. However, its clustering quality mainly depends on the parameter setting. For the adequate choice of the parameters to be preset, it has difficulty in its operability without enough domain knowledge. To solve such problem, in this paper it proposed a new approach to immediately inference an appropriate value for one of the parameters named bandwidth. Based on the Bayesian Theorem, it is to infer the suitable parameter value by the constructed parameter estimation model. Then the user only has to preset the other parameter noise threshold. As a result, the clusters can be identified by the determined parameter values. The experimental results show that the proposed method has complementary advantages in the density-based clustering algorithm.
AB - Knowledge discovery in large multimedia databases which usually contain large amounts of noise and high-dimensional feature vectors is an increasingly important research issue. Density-based clustering is proved to be much more efficient when dealing with such databases. However, its clustering quality mainly depends on the parameter setting. For the adequate choice of the parameters to be preset, it has difficulty in its operability without enough domain knowledge. To solve such problem, in this paper it proposed a new approach to immediately inference an appropriate value for one of the parameters named bandwidth. Based on the Bayesian Theorem, it is to infer the suitable parameter value by the constructed parameter estimation model. Then the user only has to preset the other parameter noise threshold. As a result, the clusters can be identified by the determined parameter values. The experimental results show that the proposed method has complementary advantages in the density-based clustering algorithm.
KW - Bayesian posterior probability estimation
KW - DENCLUE
KW - Density-based clustering
KW - Optimal bandwidth selection
UR - http://www.scopus.com/inward/record.url?scp=84921064769&partnerID=8YFLogxK
U2 - 10.1504/IJDMMM.2014.066763
DO - 10.1504/IJDMMM.2014.066763
M3 - Article
AN - SCOPUS:84921064769
SN - 1759-1163
VL - 6
SP - 347
EP - 368
JO - International Journal of Data Mining, Modelling and Management
JF - International Journal of Data Mining, Modelling and Management
IS - 4
ER -