DocumentCode :
245080
Title :
Fast Algorithms for Frequent Itemset Mining from Uncertain Data
Author :
Leung, Carson Kai-Sang ; MacKinnon, Richard Kyle ; Tanbeer, Syed K.
Author_Institution :
Dept. of Comput. Sci., Univ. of Manitoba, Winnipeg, MB, Canada
fYear :
2014
fDate :
14-17 Dec. 2014
Firstpage :
893
Lastpage :
898
Abstract :
The majority of existing data mining algorithms mine frequent item sets from precise databases. A well-known algorithm is FP-growth, which builds a compact FP-tree structure to capture important contents of the database and mines frequent item sets from the FP-tree. However, there are situations in which data are uncertain. In recent years, researchers have paid attention to frequent item set mining from uncertain databases. UFP-growth is one of the frequently cited algorithms for mining uncertain data. However, the corresponding UFP-tree structure can be large. Other tree structures for handling uncertain data may achieve compactness at the expense of looser upper bounds on expected supports. To solve this problem, we propose two compact tree structures which capture uncertain data with tighter upper bounds than existing tree structures. We also designed two algorithms that mine frequent item sets from our proposed trees. Our experimental results show the tightness of bounds to expected supports provided by these algorithms.
Keywords :
data handling; data mining; tree data structures; FP-growth; UF-growth algorithm; UF-tree structure; compact FP-tree structure; data mining algorithms; fast algorithms; frequent itemset mining; looser upper bounds; tightened upper bounds; uncertain data handling; Algorithm design and analysis; Data mining; Electron tubes; Integrated circuits; Itemsets; Upper bound; Association analysis; data mining algorithms; frequent patterns; tree structures; uncertain data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
ISSN :
1550-4786
Print_ISBN :
978-1-4799-4303-6
Type :
conf
DOI :
10.1109/ICDM.2014.146
Filename :
7023419
Link To Document :
بازگشت