مرکز منطقه ای اطلاع رساني علوم و فناوري - A pattern decomposition (PD) algorithm for finding all frequent patterns in large datasets

DocumentCode :

2336146

Title :

A pattern decomposition (PD) algorithm for finding all frequent patterns in large datasets

Author :

Zou, Qinghua ; Chu, Wesley ; Johnson, David ; Chiu, Henry

Author_Institution :

Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA

fYear :

2001

fDate :

2001

Firstpage :

673

Lastpage :

674

Abstract :

Efficient algorithms to mine frequent patterns are crucial to many tasks in data mining. Since the Apriori algorithm was proposed (R. Agrawal and R. Srikant, 1994), there have been several methods proposed to improve its performance. However, most still adopt its candidate set generation-and-test approach. We propose a pattern decomposition (PD) algorithm that can significantly reduce the size of the dataset on each pass, making it more efficient to mine frequent patterns in a large dataset. The proposed algorithm avoids the costly process of candidate set generation and saves time by reducing dataset. Our empirical evaluation shows that the algorithm outperforms Apriori by one order of magnitude and is faster than FP-tree. Further, PD is more scalable than both Apriori and FP-tree

Keywords :

data mining; pattern recognition; set theory; very large databases; Apriori algorithm; FP-tree; candidate set generation; candidate set generation-and-test approach; data mining; frequent pattern mining; large datasets; pattern decomposition algorithm; Association rules; Computer science; Data mining; Itemsets;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on

Conference_Location :

San Jose, CA

Print_ISBN :

0-7695-1119-8

Type :

conf

DOI :

10.1109/ICDM.2001.989603

Filename :

989603

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2336146