• DocumentCode
    3133992
  • Title

    Efficient similarity search in streaming time sequences

  • Author

    Kontaki, M. ; Papadopoulos, A.N.

  • Author_Institution
    Dept. of Informatics, Aristotle Univ., Thessaloniki, Greece
  • fYear
    2004
  • fDate
    21-23 June 2004
  • Firstpage
    63
  • Lastpage
    72
  • Abstract
    Query processing in data streams is a very important research direction. The challenge in a database of data streams is to provide efficient algorithms and access methods for query processing, taking into consideration the fact that the database changes continuously as new data arrive. Traditional access methods that continuously update the data are considered inefficient, due to the significant update costs. In this paper we present IDC-Index, an efficient technique for similarity query processing in streaming time sequences, which is based on a multidimensional access method enhanced with a deferred update policy and an incremental computation of the discrete Fourier transform (DFT), which is used as a feature extraction method. The method manages to reduce the number of false alarms examined and therefore achieves high answers/candidates ratio. Moreover, an extensive performance evaluation based on synthetic random walk and real time sequences have shown that the proposed technique outperforms significantly existing approaches for similarity range query processing.
  • Keywords
    data mining; discrete Fourier transforms; feature extraction; query processing; temporal databases; time series; IDC-Index; data streams; database access; deferred update policy; discrete Fourier transform; feature extraction; multidimensional access; performance evaluation; real time sequences; similarity query processing; similarity search; streaming time sequences; synthetic random walk; Conference management; Databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Scientific and Statistical Database Management, 2004. Proceedings. 16th International Conference on
  • ISSN
    1099-3371
  • Print_ISBN
    0-7695-2146-0
  • Type

    conf

  • DOI
    10.1109/SSDM.2004.1311194
  • Filename
    1311194