DocumentCode :
234700
Title :
Proposed algorithm for frequent item set generation
Author :
Singh, Ashutosh ; Agarwal, Jatin
Author_Institution :
ASET, Amity Univ. Uttar Pradesh, Noida, India
fYear :
2014
fDate :
7-9 Aug. 2014
Firstpage :
160
Lastpage :
165
Abstract :
Data mining is an efficient technology to discover patterns in large databases. Association rule mining techniques are used to find the correlation between the various item sets in the database, and this correlation between various item sets are used in decision making and pattern analysis. In recent years the problem of finding frequent items and association rules from large datasets has been proposed by many researchers. Various research papers on association rule mining (ARM) are studied and analyzed first to understand the existing algorithms. The Apriori algorithm is the basic ARM algorithm, but it requires so many database scans to find frequent items. In Dynamic Item set counting (DIC) algorithm less number of database scans are needed, but complex data structure lattice is used. The main focus of this paper is to propose a new optimized algorithm (FI-generator) and to compare its performance with the existing algorithms. A secondary data set is used to find out frequent item sets and association rules with the help of existing and proposed algorithm). We observed that the proposed algorithm find out the frequent item sets and association rules from databases as compared to the existing algorithms in less numbers of database scans. In the proposed algorithm an optimized data structure adjacency matrix is used. Proposed algorithm reduces the size of candidate-K item set in successive iteration. Pruning is also done at two stages which reduces the memory space.
Keywords :
data mining; data structures; database management systems; decision making; ARM algorithm; Apriori algorithm; DIC algorithm; FI-generator algorithm; association rule mining techniques; candidate-K item set; complex data structure lattice; data mining; database scans; decision making; dynamic item set counting algorithm; frequent item set generation; optimized data structure adjacency matrix; pattern analysis; pattern discovery; Association rules; Heuristic algorithms; Partitioning algorithms; Solids; Transaction databases; Adjacency matrix; Association rules; Data mining; FI-generator; Frequent item sets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Contemporary Computing (IC3), 2014 Seventh International Conference on
Conference_Location :
Noida
Print_ISBN :
978-1-4799-5172-7
Type :
conf
DOI :
10.1109/IC3.2014.6897166
Filename :
6897166
Link To Document :
بازگشت