DocumentCode :
2142921
Title :
Efficient Algorithm for Discovering Potential Interesting Patterns with Closed Itemsets
Author :
Singh, Raj ; Johnsten, Tom ; Raghavan, Vijay ; Xie, Ying
Author_Institution :
Sch. of Sci. & Comp. Eng., Univ. of Houston Clear Lake, Houston, TX, USA
fYear :
2010
fDate :
14-16 Aug. 2010
Firstpage :
414
Lastpage :
419
Abstract :
A pattern discovered from a collection of data is usually considered potentially interesting if its information content can assist the user in their decision making process. To that end, we have defined the potential interestingness of a pattern based on whether it provides statistical knowledge that is able to affect one´s belief system. In previous work, we proposed two novel algorithms, Discovering All Potentially Interesting Patterns (DAPIP) and All-Confidence Discovery of Potentially Interesting Patterns (ACDPIP), designed to discover potentially interesting patterns from a collection of data. Results of experimental investigations show that the application of these two algorithms is limited to non-dense datasets. In response, we propose a new algorithm, referred to as ACDPIP-Closed, designed to discover potential interesting patterns from dense datasets. We show empirically that ACDPIP-Closed is able to effectively and efficiently discover potentially interesting patterns from dense datasets. Additional contributions provided by the paper include a definition of a frequent closed itemset based on an all-confidence threshold and a theorem stating that, under the assumption of a particular ordering of items, an itemset is support based closed if and only if it is all-confidence based closed.
Keywords :
data mining; decision making; pattern classification; ACDPIP; DAPIP; all-confidence discovery of potentially interesting patterns; belief system; closed itemsets; data collection; decision making process; discovering all potentially interesting patterns; frequent closed itemset; information content; nondense datasets; potential interesting patterns; statistical knowledge; Algorithm design and analysis; Association rules; Context; Educational institutions; Itemsets; Noise; Redundancy; Closed itemsets; Data Mining; Interesting Patterns; Positive and Negative rules; assoiciation rules;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing (GrC), 2010 IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
978-1-4244-7964-1
Type :
conf
DOI :
10.1109/GrC.2010.55
Filename :
5575950
Link To Document :
بازگشت