TY - JOUR
T1 - Distance measure for symbolic approximation representation with subsequence direction for time series data mining
AU - Li, Tianyu
AU - Dong, Fang Yan
AU - Hirota, Kaoru
PY - 2013/3
Y1 - 2013/3
N2 - A distance measure is proposed for time series data mining based on symbolic aggregate approximation (SAX) with direction representation. It aims at increasing lower bound tightness to Euclidean distance and decreasing the error rate of time series data mining tasks by adding the time series subsequence direction factor to original SAX. Experiments on public University of California, Riverside (UCR) time series datasets, which contain various time series data with diverse type, length, and size, demonstrate that the tightness of the proposed distance measure increases 17.54% on average when compared with that of original SAX, and classification error rates on SAX with direction representation are reduced by 16.22% in comparison with that of results obtained by original SAX. The proposed approach lowers the classification error rate and could be applied to other time series data mining tasks, such as clustering, query by content, and motif discovery.
AB - A distance measure is proposed for time series data mining based on symbolic aggregate approximation (SAX) with direction representation. It aims at increasing lower bound tightness to Euclidean distance and decreasing the error rate of time series data mining tasks by adding the time series subsequence direction factor to original SAX. Experiments on public University of California, Riverside (UCR) time series datasets, which contain various time series data with diverse type, length, and size, demonstrate that the tightness of the proposed distance measure increases 17.54% on average when compared with that of original SAX, and classification error rates on SAX with direction representation are reduced by 16.22% in comparison with that of results obtained by original SAX. The proposed approach lowers the classification error rate and could be applied to other time series data mining tasks, such as clustering, query by content, and motif discovery.
KW - Data mining
KW - Direction representation
KW - Distance measure
KW - Symbolic aggregate approximation
KW - Time series data
UR - http://www.scopus.com/inward/record.url?scp=84879367378&partnerID=8YFLogxK
U2 - 10.20965/jaciii.2013.p0263
DO - 10.20965/jaciii.2013.p0263
M3 - Article
AN - SCOPUS:84879367378
SN - 1343-0130
VL - 17
SP - 263
EP - 271
JO - Journal of Advanced Computational Intelligence and Intelligent Informatics
JF - Journal of Advanced Computational Intelligence and Intelligent Informatics
IS - 2
ER -