DocumentCode :
3108367
Title :
Concise representations for approximate association rules
Author :
Xu, Yue ; Li, Yuefeng ; Shaw, Gavin
Author_Institution :
Fac. of Inf. Technol., Queensland Univ. of Technol., Brisbane, QLD
fYear :
2008
fDate :
12-15 Oct. 2008
Firstpage :
94
Lastpage :
101
Abstract :
The quality of association rule mining has drawn more and more attention recently. One problem with the quality of the discovered association rules is the huge size of the extracted rule set. Often for a dataset, a huge number of rules can be extracted, but many of them can be redundant to other rules and thus useless in practice. Mining non-redundant rules is a promising approach to solve this problem. In this paper, we firstly propose a definition for redundancy; then we propose a concise representation called reliable basis for representing non-redundant association rules for both exact rules and approximate rules. We prove that the redundancy elimination based on the reliable basis does not reduce the belief to the extracted rules. We also prove that all association rules can be deduced from the reliable basis. Therefore the reliable basis is a lossless representation of association rules. Experimental results show that the reliable basis significantly reduces the number of extracted rules.
Keywords :
data mining; knowledge representation; approximate association rules; association rule mining; concise representations; nonredundant association rules; reliable basis; Association rules; Australia; Data analysis; Data mining; Information retrieval; Information technology; Itemsets; Redundancy; Association rule mining; certainty factor; closed itemsets; generator; redundant association rules;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location :
Singapore
ISSN :
1062-922X
Print_ISBN :
978-1-4244-2383-5
Electronic_ISBN :
1062-922X
Type :
conf
DOI :
10.1109/ICSMC.2008.4811257
Filename :
4811257
Link To Document :
بازگشت