Title :
Improved Association Rule Mining by Modified Trimming
Author :
Hwang, Wontae ; Kim, Dongseung
Author_Institution :
Korea University, Korea
Abstract :
This paper presents a new association mining algorithm that uses two phase sampling for shortening the execution time at the cost of precision of the mining result. Previous FAST (Finding Association by Sampling Technique) algorithm has the weakness in that it only considered the frequent 1-itemsets in trimming/growing, thus, it did not have ways of considering mulit-itemsets including 2-itemsets. The new algorithm reflects the multi-itemsets in sampling transactions. It improves the mining results by adjusting the counts of both missing itemsets and false itemsets. Experimentally on a representative synthetic database, the accuracy of 2-itemsets reaches 0.68 compared to 0.46 while it maintains the same quality.
Keywords :
Algorithm design and analysis; Association rules; Costs; Data analysis; Data mining; Data structures; Information technology; Itemsets; Sampling methods; Transaction databases;
Conference_Titel :
Computer and Information Technology, 2006. CIT '06. The Sixth IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
0-7695-2687-X
DOI :
10.1109/CIT.2006.101