Title :
Mining partially periodic event patterns with unknown periods
Author :
Ma, Sheng ; Hellerstein, Joseph L.
Author_Institution :
IBM T.J. Watson Res. Center, Hawthorne, NY, USA
Abstract :
Periodic behavior is common in real-world applications. However in many cases, periodicities are partial in that they are present only intermittently. The authors study such intermittent patterns, which they refer to as p-patterns. The formulation of p-patterns takes into account imprecise time information (e.g., due to unsynchronized clocks in distributed environments), noisy data (e.g., due to extraneous events), and shifts in phase and/or periods. We structure mining for p-patterns as two sub-tasks: (1) finding the periods of p-patterns and (2) mining temporal associations. For (2), a level-wise algorithm is used. For (1), we develop a novel approach based on a chi-squared test, and study its performance in the presence of noise. Further we develop two algorithms for mining p-patterns based on the order in which the aforementioned sub-tasks are performed: the period-first algorithm and the association-first algorithm. Our results show that the association-first algorithm has a higher tolerance to noise; the period-first algorithm is more computationally efficient and provides flexibility as to the specification of support levels. In addition, we apply the period-first algorithm to mining data collected from two production computer networks, a process that led to several actionable insights
Keywords :
computer network management; data mining; temporal logic; very large databases; association-first algorithm; chi-squared test; data mining; distributed environments; extraneous events; imprecise time information; intermittent patterns; level-wise algorithm; noisy data; p-patterns; partially periodic event pattern mining; period-first algorithm; periodic behavior; production computer networks; real-world applications; support levels; temporal association mining; unknown periods; unsynchronized clocks; Clocks; Computer networks; Data mining; Marketing and sales; Noise level; Pattern analysis; Phase noise; Security; Testing; Working environment noise;
Conference_Titel :
Data Engineering, 2001. Proceedings. 17th International Conference on
Conference_Location :
Heidelberg
Print_ISBN :
0-7695-1001-9
DOI :
10.1109/ICDE.2001.914829