DocumentCode :
2130760
Title :
Stream-Close: Fast Mining of Closed Frequent Itemsets in High Speed Data Streams
Author :
Ranganath, B.N. ; Murty, M. Narasimha
Author_Institution :
Stochastic Syst. Lab., Indian Inst. of Sci., Bangalore
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
516
Lastpage :
525
Abstract :
With the emergence of large-volume and high-speed streaming data, the recent techniques for stream mining of CFIpsilas (closed frequent itemsets) will become inefficient. When concept drift occurs at a slow rate in high speed data streams, the rate of change of information across different sliding windows will be negligible. So, the user wonpsilat be devoid of change in information if we slide window by multiple transactions at a time. Therefore, we propose a novel approach for mining CFIpsilas cumulatively by making sliding width(ges1) over high speed data streams. However, it is nontrivial to mine CFIpsilas cumulatively over stream, because such growth may lead to the generation of exponential number of candidates for closure checking. In this study, we develop an efficient algorithm, stream-close, for mining CFIpsilas over stream by exploring some interesting properties. Our performance study reveals that stream-close achieves good scalability and has promising results.
Keywords :
data mining; closed frequent itemset mining; closed frequent itemsets; high speed data streams; Association rules; Conferences; Data mining; History; Itemsets; Scalability; Stochastic systems; Telephony; Transaction databases; Web pages; Association rules; CFI´s; Data stream;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops, 2008. ICDMW '08. IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3503-6
Electronic_ISBN :
978-0-7695-3503-6
Type :
conf
DOI :
10.1109/ICDMW.2008.51
Filename :
4733975
Link To Document :
بازگشت