DocumentCode :
1975661
Title :
Mining frequent itemset from uncertain data
Author :
Gao, Feng ; Wu, Chengrong
Author_Institution :
Sch. of Comput. Sci., Fudan Univ., Shanghai, China
fYear :
2011
fDate :
16-18 Sept. 2011
Firstpage :
2329
Lastpage :
2333
Abstract :
We study the problem of mining frequent itemset from probabilistic data. Firstly, to solve the semantic corruption brought by expected frequent itemset conception, we define the probabilistic frequent itemset which is consistent with possible world model and holds the apriori property. Secondly, we develop a dynamic programming like polynomial algorithm for testing candidate frequent itemsets. Finally, a P-Apriori algorithm for mining top-A probabilistic frequent itemsets is presented, which can incrementally report probabilistic frequent itemsets one-by-one in descending order of their confidences. Comprehensive experiments have been conducted on both real and synthetic datasets to verify the effectiveness and efficiency of the algorithm. The results show that P-Apriori algorithm performs stably on various parameter configurations.
Keywords :
data mining; dynamic programming; probability; P-apriori algorithm; apriori property; dynamic programming; frequent itemset mining; probabilistic data; probabilistic frequent itemset; semantic corruption; uncertain data; Algorithm design and analysis; Data mining; Heuristic algorithms; Itemsets; Probabilistic logic; Uncertainty; Apriori; dynamic programming; frequent itemset; probabilistic data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Control Engineering (ICECE), 2011 International Conference on
Conference_Location :
Yichang
Print_ISBN :
978-1-4244-8162-0
Type :
conf
DOI :
10.1109/ICECENG.2011.6057179
Filename :
6057179
Link To Document :
بازگشت