DocumentCode :
3190352
Title :
Sequential Change Detection on Data Streams
Author :
Muthukrishnan, S. ; van den Berg, Eric ; Wu, Yihua
Author_Institution :
Rutgers Univ., Piscataway
fYear :
2007
fDate :
28-31 Oct. 2007
Firstpage :
551
Lastpage :
550
Abstract :
Model-based declarative queries are becoming an attractive paradigm for interacting with many data stream applications. This has led to the development of techniques to accurately answer the queries using distributional models rather than raw values. The quintessential problem with this is that of detecting when there is a change in the input stream, which makes models stale and inaccurate. We adopt the sound statistical method of sequential hypothesis testing to study this problem on streams, without independence assumption. It yields algorithms that are fast, space-efficient, and oblivious to data´s underlying distributions. Our experiments demonstrate the effectiveness of our methods to not only determine the existence of a change, but also the point where the change is initiated, relative to the ground truth we obtain. Our methods work seamlessly without window limitations inherent in prior work, thus have clearly shorter delays compared to alternative window-based solutions.
Keywords :
query processing; statistical testing; distributional model; model-based declarative query; sequential data stream change detection; sequential hypothesis testing; statistical method; Change detection algorithms; Conferences; Data mining; Delay effects; Delay estimation; Detection algorithms; IP networks; Monitoring; Sequential analysis; Statistical analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2007. ICDM Workshops 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3019-2
Electronic_ISBN :
978-0-7695-3033-8
Type :
conf
DOI :
10.1109/ICDMW.2007.89
Filename :
4476721
Link To Document :
بازگشت