Title :
Minimization of Suffix Array´s Storage Capacity for Periodicity Detection in Time Series
Author :
Xylogiannopoulos, K.F. ; Karampelas, P. ; Alhajj, Reda
Author_Institution :
Dept. of Inf. Technol., Hellenic American Univ., Manchester, NH, USA
Abstract :
In everyday life bulk amount of time-stamped data is accumulated in diverse databases. Such data may be mapped into a time-based representation forming very long time series which could be effectively analyzed for valuable knowledge discovery. However, most of the times analyzing these time series has been proven a very complicated task especially when they are very large. This paper tackles the problem by proposing an optimization method for storing very large time series in suffix arrays for further analysis, and repeated pattern detection is proposed as well. Based on this method, the required part of the time series to be stored for repeated pattern detection can be reduced by at least 25%. The method was applied to DNA chains with length up to 100,000,000 characters long and the corresponding results are presented.
Keywords :
data mining; object detection; optimisation; storage management; time series; DNA chains; data mapping; diverse database; knowledge discovery; optimization method; pattern detection; periodicity detection; storage capacity; suffix array; time series; time-based representation; time-stamped data accumulation; Arrays; DNA; Data mining; Database systems; Time series analysis; DNA analysis; data mining; periodicity detection; space capacity requirement; suffix arrays; time series;
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2012 IEEE 24th International Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4799-0227-9
DOI :
10.1109/ICTAI.2012.49