DocumentCode :
3104921
Title :
STAGGER: Periodicity Mining of Data Streams Using Expanding Sliding Windows
Author :
Elfeky, Mohamed G. ; Aref, Walid G. ; Elmagarmid, Ahmed K.
Author_Institution :
Google Inc., Mountain View, CA
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
188
Lastpage :
199
Abstract :
Sensor devices are becoming ubiquitous, especially in measurement and monitoring applications. Because of the real-time, append-only and semi-infinite natures of the generated sensor data streams, an online incremental approach is a necessity for mining stream data types. In this paper, we propose STAGGER: a one-pass, online and incremental algorithm for mining periodic patterns in data streams. STAGGER does not require that the user pre-specify the periodicity rate of the data. Instead, STAGGER discovers the potential periodicity rates. STAGGER maintains multiple expanding sliding windows staggered over the stream, where computations are shared among the multiple overlapping windows. Small-length sliding windows are imperative for early and real-time output, yet are limited to discover short periodicity rates. As streamed data arrives continuously, the sliding windows expand in length in order to cover the whole stream. Larger-length sliding windows are able to discover longer periodicity rates. STAGGER incrementally maintains a tree-like data structure for the frequent periodic patterns of each discovered potential periodicity rate. In contrast to the Fourier/Wavelet-based approaches used for discovering periodicity rates, STAGGER not only discovers a wider, more accurate set of periodicities, but also discovers the periodic patterns themselves. In fact, experimental results with real and synthetic data sets show that STAGGER outperforms Fourier/Wavelet-based approaches by an order of magnitude in terms of the accuracy of the discovered periodicity rates. Moreover, real-data experiments demonstrate the practicality of the discovered periodic patterns.
Keywords :
data mining; tree data structures; STAGGER; data streams mining; expanding sliding windows; online incremental approach; periodicity mining; periodicity rates discovering; sensor devices; tree-like data structure; Computerized monitoring; Data mining; Frequency; History; Hysteresis; Inspection; Pervasive computing; Telephony; Tree data structures; Windows;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.153
Filename :
4053047
Link To Document :
بازگشت