Title :
Fuzzy association rule mining and classifier with chi-squared correlation measure using genetic network programming
Author :
Taboada, Karla ; Mabu, Shingo ; Gonzales, Eloy ; Shimada, Kaoru ; Hirasawa, Kotaro
Author_Institution :
Grad. Sch. of Inf., Production & Syst., Waseda Univ., Kitakyushu, Japan
Abstract :
One of the most important issues in any association rule mining is the interpretation and evaluation of discovered rules. Thus, most algorithms employ the support-confidence framework for evaluating association and classification rules. Unfortunately, recent studies show that the support and confidence measures are insufficient for filtering out uninteresting association rules, for instance, even strong association rules can be uninteresting and misleading. To deal with this limitation, the support-confidence framework can be supplemented with additional interestingness measures based on statistical significance and correlation analysis. In this paper, a novel fuzzy association rule-based classification approach is proposed, where chi2 is applied as a correlation measure. The algorithm is based on Genetic Network Programming (GNP) and discover comprehensible fuzzy association rules potentially useful for classification. GNP is an evolutionary optimization algorithm that uses directed graph structures as genes instead of strings and trees of Genetic Algorithms (GA) and Genetic Programming (GP), respectively. This feature contributes to creating quite compact programs and implicitly memorizing past action sequences. The proposed model consists of two major phases: 1 generating fuzzy class association rules by using GNP, 2 building a classifier based on the extracted fuzzy rules. In the first phase, chi2 is used for computing the correlation of the rules to be integrated into the classifier. In the second phase, the chi2 value is used as a weight of the rule when calculating the matching degree of the rule with new data. The performance of the proposed algorithm has been compared with other relevant algorithms and the experimental results have shown the advantages and effectiveness of the proposed model.
Keywords :
correlation methods; data mining; directed graphs; fuzzy set theory; genetic algorithms; pattern classification; chi-squared correlation measure; correlation analysis; directed graph structure; discovered rules evaluation; evolutionary optimization algorithm; fuzzy association rule mining; genetic network programming; statistical significance; support confidence framework; Association rules; Data mining; Databases; Economic indicators; Fuzzy sets; Genetic algorithms; Genetic programming; Itemsets; Testing; Tree graphs; association rule mining; classification; fuzzy association rules; fuzzy membership functions; genetic network programming;
Conference_Titel :
ICCAS-SICE, 2009
Conference_Location :
Fukuoka
Print_ISBN :
978-4-907764-34-0
Electronic_ISBN :
978-4-907764-33-3