DocumentCode
1396006
Title
Querying time series data based on similarity
Author
Rafiei, Davood ; Mendelzon, Alberto O.
Author_Institution
Dept. of Comput. Sci., Alberta Univ., Edmonton, Alta., Canada
Volume
12
Issue
5
fYear
2000
Firstpage
675
Lastpage
693
Abstract
We study similarity queries for time series data where similarity is defined, in a fairly general way, in terms of a distance function and a set of affine transformations on the Fourier series representation of a sequence. We identify a safe set of transformations supporting a wide variety of comparisons and show that this set is rich enough to formulate operations such as moving average and time scaling. We also show that queries expressed using safe transformations can efficiently be computed without prior knowledge of the transformations. We present a query processing algorithm that uses the underlying multidimensional index built over the data set to efficiently answer similarity queries. Our experiments show that the performance of this algorithm is competitive to that of processing ordinary (exact match) queries using the index, and much faster than sequential scanning. We propose a generalization of this algorithm for simultaneously handling multiple transformations at a time, and give experimental results on the performance of the generalized algorithm
Keywords
Fourier series; data mining; data warehouses; query processing; time series; Fourier series representation; affine transformations; distance function; moving average; performance; query processing algorithm; similarity queries; time scaling; time series data query; Data mining; Databases; Euclidean distance; Fluctuations; Fourier series; Fourier transforms; Indexing; Information retrieval; Multidimensional systems; Query processing;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.877502
Filename
877502
Link To Document