DocumentCode
2663224
Title
Using a clustering genetic algorithm for rule extraction from artificial neural networks
Author
Hruschka, Eduardo R. ; Ebecken, Nelson F F
Author_Institution
Fed. Univ. of Rio de Janeiro, Brazil
fYear
2000
fDate
2000
Firstpage
199
Lastpage
206
Abstract
The main challenge to the use of supervised neural networks in data mining applications is to get explicit knowledge from these models. For this purpose, a study on knowledge acquirement from supervised neural networks employed for classification problems is presented. The methodology is based on the clustering of the hidden units activation values. A clustering genetic algorithm for rule extraction from neural networks is developed. A simple encoding scheme that yields to constant-length chromosomes is used, thus allowing the application of the standard genetic operators. A consistent algorithm to avoid some of the drawbacks of this kind of representation is also developed. In addition, a very simple heuristic is applied to generate the initial population. The individual fitness is determined based on the Euclidean distances among the objects, as well as on the number of objects belonging to each cluster. The developed algorithm is experimentally evaluated in two data mining benchmarks: Iris Plants Database and Pima Indians Diabetes Database. The results are compared with those obtained by the Modified RX Algorithm (E.R. Hruschka and N.F.F. Ebecken, 1999), which is also an algorithm for rule extraction from neural networks
Keywords
data mining; genetic algorithms; heuristic programming; learning (artificial intelligence); neural nets; pattern clustering; Euclidean distances; Iris Plants Database; Modified RX Algorithm; Pima Indians Diabetes Database; artificial neural networks; classification problems; clustering genetic algorithm; consistent algorithm; constant-length chromosomes; data mining applications; data mining benchmarks; encoding scheme; explicit knowledge; hidden units activation values; individual fitness; knowledge acquirement; rule extraction; simple heuristic; standard genetic operators; supervised neural networks; Artificial neural networks; Biological cells; Clustering algorithms; Data mining; Databases; Diabetes; Encoding; Genetic algorithms; Iris; Neural networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Combinations of Evolutionary Computation and Neural Networks, 2000 IEEE Symposium on
Conference_Location
San Antonio, TX
Print_ISBN
0-7803-6572-0
Type
conf
DOI
10.1109/ECNN.2000.886235
Filename
886235
Link To Document