DocumentCode :
3703507
Title :
Compression rate distance measure for time series
Author :
Vo Thanh Vinh;Duong Tuan Anh
Author_Institution :
Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Viet Nam
fYear :
2015
Firstpage :
1
Lastpage :
10
Abstract :
In this work, we propose a Compression Rate Distance, a new distance measure for time series data. The main idea behind this distance is based on the Minimum Description Length (MDL) principle. The higher compression rate between two time series is, the closer they should be. Besides, we also propose a relaxed version of the new distance, called the Extended Compression Rate Distance. The Extended Compression Rate Distance can satisfy some crucial characteristics on time series such as Early Abandoning, Lower Bounding, and Relaxed-Triangular Inequality which help the new distance easily adapt with traditional indexing structures and searching methods. We tested our distances on classification problem with numerous datasets and compared the results with most of the commonly used distances in time series such as Euclidean Distance, Dynamic Time Warping, and a recently proposed Complexity-Invariant Distance. Experimental results reveal that our novel distances outperform several previous important distance measures in a vast majority of the datasets.
Keywords :
"Time series analysis","Euclidean distance","Time measurement","Complexity theory","Encoding","Cities and towns","Indexing"
Publisher :
ieee
Conference_Titel :
Data Science and Advanced Analytics (DSAA), 2015. 36678 2015. IEEE International Conference on
Print_ISBN :
978-1-4673-8272-4
Type :
conf
DOI :
10.1109/DSAA.2015.7344787
Filename :
7344787
Link To Document :
بازگشت