مرکز منطقه ای اطلاع رساني علوم و فناوري - Parallel frequent set mining using inverted matrix approach

DocumentCode :

1877114

Title :

Parallel frequent set mining using inverted matrix approach

Author :

Bhanderi, S.D. ; Garg, Shelly

Author_Institution :

Dept. of Comput. Eng., A.D. Patel Inst. of Technol., Vallabh Vidya Nagar, India

fYear :

2012

fDate :

6-8 Dec. 2012

Firstpage :

Lastpage :

Abstract :

Mining frequent patterns in large transactional database is considered as one of the most important data mining problems. The recent explosive growth in data collection made the current rule mining algorithms restricted and insufficient to analyze excessively large transaction sets because they suffer from many problems when mining massive transaction datasets. Some of the major problems are: (1) required multiple database scan, (2) massive computational power requirement (3) huge memory requirement, and (4) lake of parallelism (5) less of interactive nature for different support value ([1-2]). In this paper an approach of Inverted matrix, the new representation of transactional database is used and distributed it amongst parallel nodes. Frequent item from the inverted matrix is assigned to parallel nodes. In parallel implementation, a Co-Occurrence Frequent Item (COFI) tree for assigning frequent item is generated by the parallel nodes. Mining process is accomplished by all nodes which generate all frequent items in which the assigned items are participated. Here, less communication is required amongst the master node all parallel node to generate all frequent itemsets. Two techniques have been used for assignment of frequent item to the parallel nodes, viz. (1) Alternate Loop Splitting (ALS), and (2) Block Loop Splitting (BLS). We have Implemented sequential as well as parallel algorithms for frequent set mining and compared its performance on mushroom [9] database having approximately 10000 transactions, 120 different items and 23 average transaction sizes. It has been found that alternate loop splitting achieves better time complexity as compared to block loop splitting. Also both the parallel techniques are found to be better than sequential algorithm.

Keywords :

data mining; database management systems; matrix algebra; parallel algorithms; set theory; ALS; BLS; COFI tree; alternate loop splitting; block loop splitting; cooccurrence frequent item tree; data collection; data mining problems; inverted matrix approach; parallel algorithms; parallel frequent set mining; parallel nodes; rule mining algorithms; transactional database; COFI tree; frequent itemset; inverted matrix; parallel data mining;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Engineering (NUiCONE), 2012 Nirma University International Conference on

Conference_Location :

Ahmedabad

Print_ISBN :

978-1-4673-1720-7

Type :

conf

DOI :

10.1109/NUICONE.2012.6493178

Filename :

6493178

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1877114