DocumentCode :
2984660
Title :
Efficient Episode Mining of Dynamic Event Streams
Author :
Patnaik, Debprakash ; Laxman, Srivatsan ; Chandramouli, B. ; Ramakrishnan, N.
Author_Institution :
Amazon.com, Seattle, WA, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
605
Lastpage :
614
Abstract :
Discovering frequent episodes over event sequences is an important data mining problem. Existing methods typically require multiple passes over the data, rendering them unsuitable for streaming contexts. We present the first streaming algorithm for mining frequent episodes over a window of recent events in the stream. We derive approximation guarantees for our algorithm in terms of: (i) the separation of frequent episodes from infrequent ones, and (ii) the rate of change of stream characteristics. Our parameterization of the problem provides a new sweet spot in the tradeoff between making distributional assumptions over the stream and algorithmic efficiencies of mining. We illustrate how this yields significant benefits when mining practical streams from neuroscience and telecommunications logs.
Keywords :
data mining; data mining; dynamic event streams; efficient episode mining; event sequences; neuroscience; telecommunications logs; Algorithm design and analysis; Approximation algorithms; Approximation methods; Data mining; Electronic mail; Frequency shift keying; Photonic band gap; Approximation Algorithms; Data Streams; Event Sequences; Frequent Episodes; Pattern Discovery; Streaming Algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
ISSN :
1550-4786
Print_ISBN :
978-1-4673-4649-8
Type :
conf
DOI :
10.1109/ICDM.2012.84
Filename :
6413866
Link To Document :
بازگشت