A hierarchical clustering algorithm based on dynamic programming for categorical sequences

Jiadong Ren*, Shiyuan Cao, Changzhen Hu

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

More and more attention has been paid to the issue of sequence mining. In this paper, a new clustering algorithm for categorical sequences is proposed. For the property that sequences have unequal length, we introduce a similarity measure for clustering of categorical and sequential attributes. The similarity measure is derived from the regular sequence alignment and is based on the idea of dynamic programming. The relative distance between element pairs is used to compute the similarity value for two sequences. The sequence similarity measure is applied in the traditional hierarchical clustering algorithm to cluster sequences. Using a splice dataset and synthetic datasets, we show the quality of clusters generated by our proposed approach and the scalability of our algorithm.

源语言英语
页(从-至)1575-1581
页数7
期刊Journal of Computational Information Systems
7
5
出版状态已出版 - 5月 2011

指纹

探究 'A hierarchical clustering algorithm based on dynamic programming for categorical sequences' 的科研主题。它们共同构成独一无二的指纹。

引用此