DocumentCode
2191348
Title
Spatio-Temporal Symbolization of Multidimensional Time Series
Author
Hidaka, Shohei ; Yu, Chen
Author_Institution
Sch. of Knowledge Sci., Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
fYear
2010
fDate
13-13 Dec. 2010
Firstpage
249
Lastpage
256
Abstract
The present study proposes a new symbolization algorithm for multidimensional time series. We view temporal sequences as observed data generated by a dynamical system, and therefore the goal of symbolization is to estimate symbolic sequences that minimize loss of information, which is called generating partition in nonlinear physics. In order to utilize the theoretical property of symbol dynamics in data mining, our algorithm estimates symbols on multivariate time series by integrating both spatial and temporal information and selecting those dimensions in multidimensional time series containing useful information. Probabilistic symbolic sequences derived from our symbolization method can be used in various supervised and unsupervised data-mining tasks. To demonstrate this, the algorithm is evaluated by applying it to both simulated data and a real-world dataset. In both cases, the new algorithm outperforms its alternative approaches.
Keywords
data mining; symbol manipulation; time series; unsupervised learning; dynamical system; generating partition; multidimensional time series; nonlinear physics; probabilistic symbolic sequence; real world dataset; spatio temporal symbolization; symbol dynamics; temporal information; temporal sequence; theoretical property; unsupervised data mining; dimension selection; dynamical system; generating partition; heterogeneous multivariate time series; time series symbolization;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops (ICDMW), 2010 IEEE International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-1-4244-9244-2
Electronic_ISBN
978-0-7695-4257-7
Type
conf
DOI
10.1109/ICDMW.2010.86
Filename
5693307
Link To Document