DocumentCode :
3013467
Title :
An index-based approach for similarity search supporting time warping in large sequence databases
Author :
Kim, Sang-Wook ; Park, Sanghyun ; Chu, Wesley W.
Author_Institution :
Dept. of Comput., Inf. & Commun. Eng., Kangwon Nat. Univ., Chunchon, South Korea
fYear :
2001
fDate :
2001
Firstpage :
607
Lastpage :
614
Abstract :
This paper proposes a new novel method for similarity search that supports time warping in large sequence databases. Time warping enables finding sequences with similar patterns even when they are of different lengths. Previous methods for processing similarity search that supports time warping fail to employ multi-dimensional indexes without false dismissal since the time warping distance does not satisfy the triangular inequality. Our primary goal is to innovate on search performance without permitting any false dismissal. To attain this goal, we devise a new distance function Dtw-lb that consistently underestimates the time warping distance and also satisfies the triangular inequality Dtw-lb uses a 4-tuple feature vector that is extracted from each sequence and is invariant to time warping. For efficient processing of similarity search, we employ a multi-dimensional index that uses the 4-tuple feature vector as indexing attributes and Dtw-lb as a distance function. The extensive experimental results reveal that our method achieves significant speedup up to 43 times with real-world S&P 500 stock data and up to 720 times with very large synthetic data
Keywords :
database indexing; query processing; very large databases; distance function; experimental results; feature vector; large sequence databases; multidimensional indexes; similarity search; stock data; time warping; triangular inequality; Computer science; Data engineering; Data mining; Databases; Euclidean distance; Exchange rates; Indexing; Length measurement; Marketing and sales; Temperature;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2001. Proceedings. 17th International Conference on
Conference_Location :
Heidelberg
ISSN :
1063-6382
Print_ISBN :
0-7695-1001-9
Type :
conf
DOI :
10.1109/ICDE.2001.914875
Filename :
914875
Link To Document :
بازگشت