DocumentCode :
2650857
Title :
Mining Non-derivable Hypercliques
Author :
Koufakou, Anna ; Ragothaman, Pradeep
Author_Institution :
U.A. Whitaker Coll. of Eng., Florida Gulf Coast Univ., Fort Myers, FL, USA
fYear :
2011
fDate :
7-9 Nov. 2011
Firstpage :
489
Lastpage :
496
Abstract :
Hyper cliques have been successfully applied in a number of applications, e.g. clustering and noise removal. A hyper clique is an item set containing items that are strongly correlated with each other. Even though hyper cliques have been shown to handle datasets with skewed support distribution and low support threshold well, they might still face problems for dense datasets and lower h-confidence threshold. In this paper, we propose a new pruning method based on combining hyper cliques and Non-Derivable Item sets (NDIs) in order to substantially reduce the amount of generated hyper clique sets. Specifically, we propose a new collection of hyper cliques, called Non-Derivable Hyper cliques (NDHC), and present an efficient algorithm to mine these sets, called NDHC Miner. The proposed NDHC collection is a loss less representation of hyper cliques, i.e., given the item sets in NDHC, we can generate the complete collection of hyper cliques and their support, without additional scanning of the dataset. We experimentally compare NDHC with Hyper cliques (HC), as well as another condensed representation of hyper cliques, maximal hyper cliques (MHP). Our experiments show that the NDHC collection offers substantial advantages over HC, and even MHP, especially for dense datasets and lower h-confidence threshold values.
Keywords :
data handling; data mining; dataset handling; frequent itemset mining; low support threshold; lower h-confidence threshold values; maximal hyper cliques; nonderivable hyperclique mining; nonderivable item sets; skewed support distribution; Association rules; Clustering algorithms; Face; Itemsets; Noise; Runtime; dense data; frequent itemset mining; hyperclique; non-derivable itemset; skewed support distribution;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
Conference_Location :
Boca Raton, FL
ISSN :
1082-3409
Print_ISBN :
978-1-4577-2068-0
Electronic_ISBN :
1082-3409
Type :
conf
DOI :
10.1109/ICTAI.2011.80
Filename :
6103370
Link To Document :
بازگشت