• DocumentCode
    387539
  • Title

    An approach for fast subsequence matching through KMP algorithm in time series databases

  • Author

    Li, Ai-jun ; Liu, Yun-Hui ; Qi, Ying-jian ; Luo, Si-Wei

  • Author_Institution
    Dept. of Comput. Sci., Northern Jiaotong Univ., Beijing, China
  • Volume
    3
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    1292
  • Abstract
    Sequence matching in time series databases is one of the most important data mining applications. In this paper, we focus on subsequence matching. We propose an efficient approach to compare time series. To simplify the searching process, we first use the KMP algorithm to carry through rough sequence matching. As KMP is a typical algorithm for string matching, we must transform time series into 0-1 string inspired by literature (Keogh and Pazzani,1999); then we quickly. search all rough similar subsequences from major sequence and finally, to reduce the dimension of raw time series data, we use Harr wavelet transform to represent the sequence to be compared and use WT (Wavelet Transformations) coefficients to compute the similarity of two sequences. That we carry out rough matching at first may reduce the numbers of WT and quicken the whole subsequence matching process.
  • Keywords
    computational complexity; data mining; string matching; time series; wavelet transforms; Wavelet Transformations; rough sequence matching; sequence matching; similarity matching; string matching; subsequence matching; time series; time series databases; wavelet transform; Application software; Computer science; Data mining; Databases; Discrete wavelet transforms; Electronic mail; Euclidean distance; Partial response channels; Time measurement; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
  • Print_ISBN
    0-7803-7508-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2002.1167412
  • Filename
    1167412