An effective sequence clustering algorithm for checking software fault feature

Jiadong Ren; Ruixia Yao; Changzhen Hu

An effective sequence clustering algorithm for checking software fault feature

Jiadong Ren^*, Ruixia Yao, Changzhen Hu

^*Corresponding author for this work

School of Cyberspace Science and Technology

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

Software security becomes increasingly important recently in the application of software. However, existing sequences clustering algorithms directly applied to software security area have got undesirable results. In order to improve the cluster quality and the time complexity of the software fault feature, we propose a new similarity method and a sequence clustering algorithm called SCA (Sequence Clustering Algorithm). The number of common sequence elements contained in software fault feature sequences is calculated to measure the relationship among sequences. And the similarity method also monitors the degree of normalization of fault feature sequences to get more accurate cluster results. Sequences are collected into clusters by this similarity metric. Experimental results on the synthetic data have shown that our algorithm has the higher cluster quality and lower time complexity.

Original language	English
Pages (from-to)	824-829
Number of pages	6
Journal	Journal of Computational Information Systems
Volume	7
Issue number	3
Publication status	Published - Mar 2011

Keywords

Clustering analysis
Sequences
Similarity Measure

Cite this

@article{717d1607387b4be583b002c4cd726dac,

title = "An effective sequence clustering algorithm for checking software fault feature",

abstract = "Software security becomes increasingly important recently in the application of software. However, existing sequences clustering algorithms directly applied to software security area have got undesirable results. In order to improve the cluster quality and the time complexity of the software fault feature, we propose a new similarity method and a sequence clustering algorithm called SCA (Sequence Clustering Algorithm). The number of common sequence elements contained in software fault feature sequences is calculated to measure the relationship among sequences. And the similarity method also monitors the degree of normalization of fault feature sequences to get more accurate cluster results. Sequences are collected into clusters by this similarity metric. Experimental results on the synthetic data have shown that our algorithm has the higher cluster quality and lower time complexity.",

keywords = "Clustering analysis, Sequences, Similarity Measure",

author = "Jiadong Ren and Ruixia Yao and Changzhen Hu",

year = "2011",

month = mar,

language = "English",

volume = "7",

pages = "824--829",

journal = "Journal of Computational Information Systems",

issn = "1553-9105",

publisher = "Binary Information Press",

number = "3",

}

TY - JOUR

T1 - An effective sequence clustering algorithm for checking software fault feature

AU - Ren, Jiadong

AU - Yao, Ruixia

AU - Hu, Changzhen

PY - 2011/3

Y1 - 2011/3

N2 - Software security becomes increasingly important recently in the application of software. However, existing sequences clustering algorithms directly applied to software security area have got undesirable results. In order to improve the cluster quality and the time complexity of the software fault feature, we propose a new similarity method and a sequence clustering algorithm called SCA (Sequence Clustering Algorithm). The number of common sequence elements contained in software fault feature sequences is calculated to measure the relationship among sequences. And the similarity method also monitors the degree of normalization of fault feature sequences to get more accurate cluster results. Sequences are collected into clusters by this similarity metric. Experimental results on the synthetic data have shown that our algorithm has the higher cluster quality and lower time complexity.

AB - Software security becomes increasingly important recently in the application of software. However, existing sequences clustering algorithms directly applied to software security area have got undesirable results. In order to improve the cluster quality and the time complexity of the software fault feature, we propose a new similarity method and a sequence clustering algorithm called SCA (Sequence Clustering Algorithm). The number of common sequence elements contained in software fault feature sequences is calculated to measure the relationship among sequences. And the similarity method also monitors the degree of normalization of fault feature sequences to get more accurate cluster results. Sequences are collected into clusters by this similarity metric. Experimental results on the synthetic data have shown that our algorithm has the higher cluster quality and lower time complexity.

KW - Clustering analysis

KW - Sequences

KW - Similarity Measure

UR - http://www.scopus.com/inward/record.url?scp=79953731725&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:79953731725

SN - 1553-9105

VL - 7

SP - 824

EP - 829

JO - Journal of Computational Information Systems

JF - Journal of Computational Information Systems

IS - 3

ER -

An effective sequence clustering algorithm for checking software fault feature

Abstract

Keywords

Other files and links

Fingerprint

Cite this