DocumentCode
504201
Title
Fuzzy association rule mining and classifier with chi-squared correlation measure using genetic network programming
Author
Taboada, Karla ; Mabu, Shingo ; Gonzales, Eloy ; Shimada, Kaoru ; Hirasawa, Kotaro
Author_Institution
Grad. Sch. of Inf., Production & Syst., Waseda Univ., Kitakyushu, Japan
fYear
2009
fDate
18-21 Aug. 2009
Firstpage
3863
Lastpage
3869
Abstract
One of the most important issues in any association rule mining is the interpretation and evaluation of discovered rules. Thus, most algorithms employ the support-confidence framework for evaluating association and classification rules. Unfortunately, recent studies show that the support and confidence measures are insufficient for filtering out uninteresting association rules, for instance, even strong association rules can be uninteresting and misleading. To deal with this limitation, the support-confidence framework can be supplemented with additional interestingness measures based on statistical significance and correlation analysis. In this paper, a novel fuzzy association rule-based classification approach is proposed, where chi2 is applied as a correlation measure. The algorithm is based on Genetic Network Programming (GNP) and discover comprehensible fuzzy association rules potentially useful for classification. GNP is an evolutionary optimization algorithm that uses directed graph structures as genes instead of strings and trees of Genetic Algorithms (GA) and Genetic Programming (GP), respectively. This feature contributes to creating quite compact programs and implicitly memorizing past action sequences. The proposed model consists of two major phases: 1 generating fuzzy class association rules by using GNP, 2 building a classifier based on the extracted fuzzy rules. In the first phase, chi2 is used for computing the correlation of the rules to be integrated into the classifier. In the second phase, the chi2 value is used as a weight of the rule when calculating the matching degree of the rule with new data. The performance of the proposed algorithm has been compared with other relevant algorithms and the experimental results have shown the advantages and effectiveness of the proposed model.
Keywords
correlation methods; data mining; directed graphs; fuzzy set theory; genetic algorithms; pattern classification; chi-squared correlation measure; correlation analysis; directed graph structure; discovered rules evaluation; evolutionary optimization algorithm; fuzzy association rule mining; genetic network programming; statistical significance; support confidence framework; Association rules; Data mining; Databases; Economic indicators; Fuzzy sets; Genetic algorithms; Genetic programming; Itemsets; Testing; Tree graphs; association rule mining; classification; fuzzy association rules; fuzzy membership functions; genetic network programming;
fLanguage
English
Publisher
ieee
Conference_Titel
ICCAS-SICE, 2009
Conference_Location
Fukuoka
Print_ISBN
978-4-907764-34-0
Electronic_ISBN
978-4-907764-33-3
Type
conf
Filename
5332929
Link To Document