DocumentCode :
1048074
Title :
A support-ordered trie for fast frequent itemset discovery
Author :
Woon, Yew-Kwong ; Ng, Wee-Keong ; Lim, Ee-Peng
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
Volume :
16
Issue :
7
fYear :
2004
fDate :
7/1/2004 12:00:00 AM
Firstpage :
875
Lastpage :
879
Abstract :
The importance of data mining is apparent with the advent of powerful data collection and storage tools; raw data is so abundant that manual analysis is no longer possible. Unfortunately, data mining problems are difficult to solve and this prompted the introduction of several novel data structures to improve mining efficiency. Here, we critically examine existing preprocessing data structures used in association rule mining for enhancing performance in an attempt to understand their strengths and weaknesses. Our analyses culminate in a practical structure called the SOTrielT (support-ordered trie itemset) and two synergistic algorithms to accompany it for the fast discovery of frequent itemsets. Experiments involving a wide range of synthetic data sets reveal that its algorithms outperform FP-growth, a recent association rule mining algorithm with excellent performance, by up to two orders of magnitude and, thus, verifying its´ efficiency and viability.
Keywords :
data mining; tree data structures; association rule mining; data mining; data structures; fast frequent itemset discovery; support-ordered trie; Algorithm design and analysis; Association rules; Data analysis; Data mining; Data structures; Inspection; Internet; Itemsets; Remote sensing; Transaction databases;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2004.1318569
Filename :
1318569
Link To Document :
بازگشت