Title :
Space-Time Summarization of Multisensor Time Series. Case of Missing Data
Author :
Joliveau, Marc ; De Vuyst, F.
Author_Institution :
Ecole Centrale Paris, Paris
Abstract :
A wide variety of application domains have to deal with incomplete data sets. In particular, data from sensors networks are often incomplete due to factors like partial system failures or bad conditions of measurements. With such incomplete massive spatio-temporal data sets, it becomes practically hard to manipulate data and to extract knowledge. In this paper, we use the so-called Space-Time Principal Component Analysis (STPCA) as a tool for propose a representation of the data set without missing values in a reduced dimension on which we can apply data mining and knowledge extraction algorithms. The effectiveness of the proposed method is demonstrated on real vehicle traffic data set containing about 15 million of measurements with rate of incompleteness of order 20% and more. Experiments show a really good behavior and strong robustness of the method to compute a representation of the data, summarize them and keep the inherent information.
Keywords :
data mining; data structures; principal component analysis; sensor fusion; time series; data mining; data set representation; incomplete massive spatio-temporal data sets; knowledge extraction; missing data; multisensor time series; sensors networks; space-time principal component analysis; space-time summarization; Conferences; Data mining; Geographic Information Systems; Particle measurements; Principal component analysis; Robustness; Sensor systems; Telecommunication traffic; Time varying systems; Vehicles;
Conference_Titel :
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3019-2
Electronic_ISBN :
978-0-7695-3033-8
DOI :
10.1109/ICDMW.2007.76