DocumentCode
2113693
Title
An Improved Apriori Algorithm Based on the Matrix
Author
Wang, Feng ; Li, Yong-hua
Author_Institution
Sch. of Comput. Sci. & Technol., Wuhan Univ. of Technol., Wuhan
fYear
2008
fDate
18-18 Dec. 2008
Firstpage
152
Lastpage
155
Abstract
A priori algorithm is a classical algorithm of association rule mining and also is one of the most important algorithms. But it also has some limitations. It produces overfull candidates of frequent itemsets, so the algorithm needs scan database frequently when finding frequent itemsets. So it must be inefficient. To solve the bottleneck of the a priori algorithm, this paper introduces an improved algorithm based on the matrix. It uses the matrix effectively indicate the affairs in the database and uses the "AND operation" to deal with the matrix to produce the largest frequent itemsets and others. It needn\´t scan the database time and again to lookup the affairs, and also greatly reduce the number of candidates of frequent itemsets. This paper uses an example to analyze and compare the difference between the two algorithms and the result shows that the improved algorithm obtains the bonus time of calculating and promotes the efficiency of computing.
Keywords
data mining; matrix algebra; AND operation; a priori algorithm; association rule mining; database; Algorithm design and analysis; Association rules; Biomedical engineering; Computer science; Data mining; Databases; Electronic mail; Itemsets; Seminars; AND operation; Apriori algorithm; affair; frequent itemsets; matrix;
fLanguage
English
Publisher
ieee
Conference_Titel
Future BioMedical Information Engineering, 2008. FBIE '08. International Seminar on
Conference_Location
Wuhan, Hubei
Print_ISBN
978-0-7695-3561-6
Type
conf
DOI
10.1109/FBIE.2008.80
Filename
5076707
Link To Document