DocumentCode :
2993660
Title :
A Simple But Effective Stream Maximal Frequent Itemset Mining Algorithm
Author :
Li, Haifeng ; Zhang, Ning
Author_Institution :
Sch. of Inf., Central Univ. of Finance & Econ., Beijing, China
fYear :
2011
fDate :
3-4 Dec. 2011
Firstpage :
1268
Lastpage :
1272
Abstract :
Maximal frequent item sets are one of several condensed representations of frequent item sets, which store most of the information contained in frequent item sets using less space, thus being more suitable for stream mining. This paper focuses on mining maximal frequent item sets approximately over a stream landmark model. We separate the continuously arriving transactions into sections, and the mining results are indexed by an extended direct update tree, thus, a simple but effective algorithm named SMIS is proposed. In our algorithm, we employ the Chern off Bound to perform the maximal frequent item set mining in a false negative manner, which can reduce the memory cost, as well guarantee our algorithm conducting with an incremental fashion. Our experimental results on two synthetic datasets and two real world datasets show that SMIS achieves much reduced memory cost in comparison with the state-of-the-art algorithm with a 100 percent precision.
Keywords :
data mining; set theory; SMIS; chern off bound; stream landmark model; stream maximal frequent itemset mining algorithm; Approximation algorithms; Approximation methods; Data mining; Educational institutions; Indexes; Itemsets; false negative; maximal frequent itemset; stream;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Security (CIS), 2011 Seventh International Conference on
Conference_Location :
Hainan
Print_ISBN :
978-1-4577-2008-6
Type :
conf
DOI :
10.1109/CIS.2011.281
Filename :
6128440
Link To Document :
بازگشت