• DocumentCode
    2663224
  • Title

    Using a clustering genetic algorithm for rule extraction from artificial neural networks

  • Author

    Hruschka, Eduardo R. ; Ebecken, Nelson F F

  • Author_Institution
    Fed. Univ. of Rio de Janeiro, Brazil
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    199
  • Lastpage
    206
  • Abstract
    The main challenge to the use of supervised neural networks in data mining applications is to get explicit knowledge from these models. For this purpose, a study on knowledge acquirement from supervised neural networks employed for classification problems is presented. The methodology is based on the clustering of the hidden units activation values. A clustering genetic algorithm for rule extraction from neural networks is developed. A simple encoding scheme that yields to constant-length chromosomes is used, thus allowing the application of the standard genetic operators. A consistent algorithm to avoid some of the drawbacks of this kind of representation is also developed. In addition, a very simple heuristic is applied to generate the initial population. The individual fitness is determined based on the Euclidean distances among the objects, as well as on the number of objects belonging to each cluster. The developed algorithm is experimentally evaluated in two data mining benchmarks: Iris Plants Database and Pima Indians Diabetes Database. The results are compared with those obtained by the Modified RX Algorithm (E.R. Hruschka and N.F.F. Ebecken, 1999), which is also an algorithm for rule extraction from neural networks
  • Keywords
    data mining; genetic algorithms; heuristic programming; learning (artificial intelligence); neural nets; pattern clustering; Euclidean distances; Iris Plants Database; Modified RX Algorithm; Pima Indians Diabetes Database; artificial neural networks; classification problems; clustering genetic algorithm; consistent algorithm; constant-length chromosomes; data mining applications; data mining benchmarks; encoding scheme; explicit knowledge; hidden units activation values; individual fitness; knowledge acquirement; rule extraction; simple heuristic; standard genetic operators; supervised neural networks; Artificial neural networks; Biological cells; Clustering algorithms; Data mining; Databases; Diabetes; Encoding; Genetic algorithms; Iris; Neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Combinations of Evolutionary Computation and Neural Networks, 2000 IEEE Symposium on
  • Conference_Location
    San Antonio, TX
  • Print_ISBN
    0-7803-6572-0
  • Type

    conf

  • DOI
    10.1109/ECNN.2000.886235
  • Filename
    886235