DocumentCode :
2848293
Title :
A multiresolution symbolic representation of time series
Author :
Megalooikonomou, Vasileios ; Wang, Qiang ; Li, Guo ; Faloutsos, Christos
Author_Institution :
Dept. of Comput. & Inf. Sci., Temple Univ., Philadelphia, PA, USA
fYear :
2005
fDate :
5-8 April 2005
Firstpage :
668
Lastpage :
679
Abstract :
Efficiently and accurately searching for similarities among time series and discovering interesting patterns is an important and non-trivial problem. In this paper, we introduce a new representation of time series, the multiresolution vector quantized (MVQ) approximation, along with a new distance function. The novelty of MVQ is that it keeps both local and global information about the original time series in a hierarchical mechanism, processing the original time series at multiple resolutions. Moreover, the proposed representation is symbolic employing key subsequences and potentially allows the application of text-based retrieval techniques into the similarity analysis of time series. The proposed method is fast and scales linearly with the size of database and the dimensionality. Contrary to the vast majority in the literature that uses the Euclidean distance, MVQ uses a multi-resolution/hierarchical distance function. We performed experiments with real and synthetic data. The proposed distance function consistently outperforms all the major competitors (Euclidean, dynamic time warping, piecewise aggregate approximation) achieving up to 20% better precision/recall and clustering accuracy on the tested datasets.
Keywords :
data mining; information retrieval; statistical databases; symbol manipulation; text analysis; time series; Euclidean distance; MVQ; hierarchical distance function; multiresolution symbolic representation; multiresolution vector quantized approximation; similarity analysis; text-based retrieval techniques; time series; Aggregates; Computer science; Databases; Euclidean distance; Multiresolution analysis; Polynomials; Q measurement; Testing; Time measurement; Time series analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
ISSN :
1084-4627
Print_ISBN :
0-7695-2285-8
Type :
conf
DOI :
10.1109/ICDE.2005.10
Filename :
1410183
Link To Document :
بازگشت