Title :
Time Series Classification Using Compression Distance of Recurrence Plots
Author :
Silva, Diego F. ; De Souza, Vinicius M. A. ; Batista, Gustavo E. A. P. A.
Author_Institution :
Inst. de Cienc. Mat. e de Comput., Univ. de Sao Paulo, Sao Carlos, Brazil
Abstract :
There is a huge increase of interest for time series methods and techniques. Virtually every piece of information collected from human, natural, and biological processes is susceptible to changes over time, and the study of how these changes occur is a central issue in fully understanding such processes. Among all time series mining tasks, classification is likely to be the most prominent one. In time series classification there is a significant body of empirical research that indicates that k-nearest neighbor rule in the time domain is very effective. However, certain time series features are not easily identified in this domain and a change in representation may reveal some significant and unknown features. In this work, we propose the use of recurrence plots as representation domain for time series classification. Our approach measures the similarity between recurrence plots using Campana-Keogh (CK-1) distance, a Kolmogorov complexity-based distance that uses video compression algorithms to estimate image similarity. We show that recurrence plots allied to CK-1 distance lead to significant improvements in accuracy rates compared to Euclidean distance and Dynamic Time Warping in several data sets. Although recurrence plots cannot provide the best accuracy rates for all data sets, we demonstrate that we can predict ahead of time that our method will outperform the time representation with Euclidean and Dynamic Time Warping distances.
Keywords :
data compression; data mining; image classification; time series; video coding; CK-1 distance; Campana-Keogh distance; Euclidean distance; Kolmogorov complexity-based distance; dynamic time warping; image similarity estimation; k-nearest neighbor rule; recurrence plot compression distance; similarity measurement; time series classification; time series mining task; video compression algorithms; Accuracy; Complexity theory; Equations; Euclidean distance; Mathematical model; Time series analysis; Training; Time series; classification; dstance measure; recurrence plot;
Conference_Titel :
Data Mining (ICDM), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
DOI :
10.1109/ICDM.2013.128