An approach for fast subsequence matching through KMP algorithm in time series databases

Author

Li, Ai-jun ; Liu, Yun-Hui ; Qi, Ying-jian ; Luo, Si-Wei

Author_Institution

Dept. of Comput. Sci., Northern Jiaotong Univ., Beijing, China

Volume

3

fYear

2002

fDate

2002

Firstpage

1292

Abstract

Sequence matching in time series databases is one of the most important data mining applications. In this paper, we focus on subsequence matching. We propose an efficient approach to compare time series. To simplify the searching process, we first use the KMP algorithm to carry through rough sequence matching. As KMP is a typical algorithm for string matching, we must transform time series into 0-1 string inspired by literature (Keogh and Pazzani,1999); then we quickly. search all rough similar subsequences from major sequence and finally, to reduce the dimension of raw time series data, we use Harr wavelet transform to represent the sequence to be compared and use WT (Wavelet Transformations) coefficients to compute the similarity of two sequences. That we carry out rough matching at first may reduce the numbers of WT and quicken the whole subsequence matching process.

Keywords

computational complexity; data mining; string matching; time series; wavelet transforms; Wavelet Transformations; rough sequence matching; sequence matching; similarity matching; string matching; subsequence matching; time series; time series databases; wavelet transform; Application software; Computer science; Data mining; Databases; Discrete wavelet transforms; Electronic mail; Euclidean distance; Partial response channels; Time measurement; Wavelet transforms;

fLanguage

English

Publisher

ieee

Conference_Titel

Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on

Print_ISBN

0-7803-7508-4

Type

conf

DOI

10.1109/ICMLC.2002.1167412

Filename

1167412