Title :
Mining Frequent Itemsets over Data Streams with Multiple Time-Sensitive Sliding Windows
Author :
Jin, Long ; Chai, Duck Jin ; Lee, Yang Koo ; Ryu, Keun Ho
Abstract :
A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. Consequently, the knowledge embedded in a data stream is more likely to be changed as time goes by. Frequent pattern is a kind of data mining techniques discovered knowledge and has been widely studied over the last decade. There are several models and approaches for mining such knowledge, but all previous works only consider a static length of sliding window for mining frequent itemsets. We propose a multiple slidng windows for mining frequent patterns on data stream in this paper. The details of study scope are as follows. We propose an efficient discounting method with different lengths of time-sensitive sliding-window. This discounting method doesn´t lose the information about Acount and also saves much memory space. Finally, we implement and evaluate the proposed algorithms for mining frequent itemsets on data stream.
Keywords :
Bioinformatics; Data analysis; Data mining; Data structures; Databases; Feeds; Information technology; Itemsets; Laboratories; Telecommunication traffic; Data MiningKnowledge Discovery;
Conference_Titel :
Advanced Language Processing and Web Information Technology, 2007. ALPIT 2007. Sixth International Conference on
Conference_Location :
Luoyang, Henan, China
Print_ISBN :
978-0-7695-2930-1
DOI :
10.1109/ALPIT.2007.39