A hierarchical clustering algorithm based on dynamic programming for categorical sequences

Jiadong Ren; Shiyuan Cao; Changzhen Hu

A hierarchical clustering algorithm based on dynamic programming for categorical sequences

Jiadong Ren^*, Shiyuan Cao, Changzhen Hu

^*此作品的通讯作者

网络空间安全学院

科研成果: 期刊稿件 › 文章 › 同行评审

4 引用（Scopus）

摘要

More and more attention has been paid to the issue of sequence mining. In this paper, a new clustering algorithm for categorical sequences is proposed. For the property that sequences have unequal length, we introduce a similarity measure for clustering of categorical and sequential attributes. The similarity measure is derived from the regular sequence alignment and is based on the idea of dynamic programming. The relative distance between element pairs is used to compute the similarity value for two sequences. The sequence similarity measure is applied in the traditional hierarchical clustering algorithm to cluster sequences. Using a splice dataset and synthetic datasets, we show the quality of clusters generated by our proposed approach and the scalability of our algorithm.

源语言	英语
页（从-至）	1575-1581
页数	7
期刊	Journal of Computational Information Systems
卷	7
期	5
出版状态	已出版 - 5月 2011

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{963b940f37b34d67a5c69811c5ae4a24,

title = "A hierarchical clustering algorithm based on dynamic programming for categorical sequences",

abstract = "More and more attention has been paid to the issue of sequence mining. In this paper, a new clustering algorithm for categorical sequences is proposed. For the property that sequences have unequal length, we introduce a similarity measure for clustering of categorical and sequential attributes. The similarity measure is derived from the regular sequence alignment and is based on the idea of dynamic programming. The relative distance between element pairs is used to compute the similarity value for two sequences. The sequence similarity measure is applied in the traditional hierarchical clustering algorithm to cluster sequences. Using a splice dataset and synthetic datasets, we show the quality of clusters generated by our proposed approach and the scalability of our algorithm.",

keywords = "Categorical sequences, Clustering, Dynamic programming",

author = "Jiadong Ren and Shiyuan Cao and Changzhen Hu",

year = "2011",

month = may,

language = "English",

volume = "7",

pages = "1575--1581",

journal = "Journal of Computational Information Systems",

issn = "1553-9105",

publisher = "Binary Information Press",

number = "5",

}

TY - JOUR

T1 - A hierarchical clustering algorithm based on dynamic programming for categorical sequences

AU - Ren, Jiadong

AU - Cao, Shiyuan

AU - Hu, Changzhen

PY - 2011/5

Y1 - 2011/5

N2 - More and more attention has been paid to the issue of sequence mining. In this paper, a new clustering algorithm for categorical sequences is proposed. For the property that sequences have unequal length, we introduce a similarity measure for clustering of categorical and sequential attributes. The similarity measure is derived from the regular sequence alignment and is based on the idea of dynamic programming. The relative distance between element pairs is used to compute the similarity value for two sequences. The sequence similarity measure is applied in the traditional hierarchical clustering algorithm to cluster sequences. Using a splice dataset and synthetic datasets, we show the quality of clusters generated by our proposed approach and the scalability of our algorithm.

AB - More and more attention has been paid to the issue of sequence mining. In this paper, a new clustering algorithm for categorical sequences is proposed. For the property that sequences have unequal length, we introduce a similarity measure for clustering of categorical and sequential attributes. The similarity measure is derived from the regular sequence alignment and is based on the idea of dynamic programming. The relative distance between element pairs is used to compute the similarity value for two sequences. The sequence similarity measure is applied in the traditional hierarchical clustering algorithm to cluster sequences. Using a splice dataset and synthetic datasets, we show the quality of clusters generated by our proposed approach and the scalability of our algorithm.

KW - Categorical sequences

KW - Clustering

KW - Dynamic programming

UR - http://www.scopus.com/inward/record.url?scp=79957656134&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:79957656134

SN - 1553-9105

VL - 7

SP - 1575

EP - 1581

JO - Journal of Computational Information Systems

JF - Journal of Computational Information Systems

IS - 5

ER -

A hierarchical clustering algorithm based on dynamic programming for categorical sequences

摘要

其它文件与链接

指纹

引用此