• DocumentCode
    2848293
  • Title

    A multiresolution symbolic representation of time series

  • Author

    Megalooikonomou, Vasileios ; Wang, Qiang ; Li, Guo ; Faloutsos, Christos

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Temple Univ., Philadelphia, PA, USA
  • fYear
    2005
  • fDate
    5-8 April 2005
  • Firstpage
    668
  • Lastpage
    679
  • Abstract
    Efficiently and accurately searching for similarities among time series and discovering interesting patterns is an important and non-trivial problem. In this paper, we introduce a new representation of time series, the multiresolution vector quantized (MVQ) approximation, along with a new distance function. The novelty of MVQ is that it keeps both local and global information about the original time series in a hierarchical mechanism, processing the original time series at multiple resolutions. Moreover, the proposed representation is symbolic employing key subsequences and potentially allows the application of text-based retrieval techniques into the similarity analysis of time series. The proposed method is fast and scales linearly with the size of database and the dimensionality. Contrary to the vast majority in the literature that uses the Euclidean distance, MVQ uses a multi-resolution/hierarchical distance function. We performed experiments with real and synthetic data. The proposed distance function consistently outperforms all the major competitors (Euclidean, dynamic time warping, piecewise aggregate approximation) achieving up to 20% better precision/recall and clustering accuracy on the tested datasets.
  • Keywords
    data mining; information retrieval; statistical databases; symbol manipulation; text analysis; time series; Euclidean distance; MVQ; hierarchical distance function; multiresolution symbolic representation; multiresolution vector quantized approximation; similarity analysis; text-based retrieval techniques; time series; Aggregates; Computer science; Databases; Euclidean distance; Multiresolution analysis; Polynomials; Q measurement; Testing; Time measurement; Time series analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
  • ISSN
    1084-4627
  • Print_ISBN
    0-7695-2285-8
  • Type

    conf

  • DOI
    10.1109/ICDE.2005.10
  • Filename
    1410183