DocumentCode
456797
Title
Frequent Itemset Mining Based on Heuristic Two Level Counting
Author
Liu, Feng ; Tian, Fengzhan ; Zhu, Qiliang
Author_Institution
Dept. of Comput. Sci. & Technol., Beijing Univ. of Posts & Telecommun.
Volume
2
fYear
2006
fDate
Aug. 30 2006-Sept. 1 2006
Firstpage
640
Lastpage
643
Abstract
Recently, many enchanced Apriori algorithms have been proposed to efficiently generate all frequent itemsets from datasets in data mining field. Although efficient techniques were presented, those algorithms are either time-consuming or memory-consuming. To address the issue further, a new algorithm, which introduced a novel support counting method, heuristic two level counting, is proposed. HTLC method adopts an improved itemset generating technology in the generation process of low level itemsets, which promotes the production of low level frequent itemsets or candidate itemsets. It also applies a heuristic traversal technology which speeds up one pass over datasets and support counting technology which largely reduces the number of passes over datasets to the generation of high level frequent itemsets. Finally, the experimental results show that it outperforms existing Apriori-like algorithms in mostly datasets
Keywords
data mining; data structures; database management systems; Apriori algorithm; candidate itemsets; frequent itemset mining; heuristic two level counting; Computer science; Data mining; Data structures; Itemsets; Production; Tree graphs;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovative Computing, Information and Control, 2006. ICICIC '06. First International Conference on
Conference_Location
Beijing
Print_ISBN
0-7695-2616-0
Type
conf
DOI
10.1109/ICICIC.2006.285
Filename
1692068
Link To Document