Title :
Verifying and Mining Frequent Patterns from Large Windows over Data Streams
Author :
Mozafari, Barzan ; Thakkar, Hetal ; Zaniolo, Carlo
Author_Institution :
Comput. Sci. Dept., Univ. of California, Los Angeles, CA
Abstract :
Mining frequent itemsets from data streams has proved to be very difficult because of computational complexity and the need for real-time response. In this paper, we introduce a novel verification algorithm which we then use to improve the performance of monitoring and mining tasks for association rules. Thus, we propose a frequent itemset mining method for sliding windows, which is faster than the state-of-the-art methods - in fact, its running time that is nearly constant with respect to the window size entails the mining of much larger windows than it was possible before. The performance of other frequent itemset mining methods (including those on static data) can be improved likewise, by replacing their counting methods (e.g., those using hash trees) by our verification algorithm.
Keywords :
data mining; association rules; computational complexity; data streams; frequent mining itemsets; verification algorithm; Association rules; Computational complexity; Computer science; Computerized monitoring; Credit cards; Data mining; Delay; Frequency; Itemsets; Partitioning algorithms;
Conference_Titel :
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location :
Cancun
Print_ISBN :
978-1-4244-1836-7
Electronic_ISBN :
978-1-4244-1837-4
DOI :
10.1109/ICDE.2008.4497426