• DocumentCode
    2690312
  • Title

    Genetic programming - a tool for flexible rule extraction

  • Author

    König, R. ; Johansson, U. ; Niklasson, L.

  • Author_Institution
    Univ. of Boras, Boras
  • fYear
    2007
  • fDate
    25-28 Sept. 2007
  • Firstpage
    1304
  • Lastpage
    1310
  • Abstract
    Although data mining is performed to support decision making, many of the most powerful techniques, like neural networks and ensembles, produce opaque models. This lack of interpretability is an obvious disadvantage, since decision makers normally require some sort of explanation before taking action. To achieve comprehensibility, accuracy is often sacrificed by the use of simpler, transparent models, such as decision trees. Another alternative is rule extraction; i.e. to transform the opaque model into a comprehensible model, keeping acceptable accuracy. We have previously suggested a rule extraction algorithm named G-REX, which is based on genetic programming. One key property of G-REX, due to the use of genetic programming, is the possibility to use different representation languages. In this study we apply G-REX to estimation tasks. More specifically, three representation languages are evaluated using eight publicly available data sets. The quality of the extracted rules is compared to two standard techniques producing comprehensible models; multiple linear regression and the decision tree algorithm C&RT. The results show that G-REX outperforms the standard techniques, but that the choice of representation language is important.
  • Keywords
    data mining; decision making; decision trees; estimation theory; genetic algorithms; knowledge representation languages; neural nets; regression analysis; C&RT algorithm; G-REX; comprehensible model; data mining; decision making; decision trees; ensembles; estimation tasks; flexible rule extraction; genetic programming; multiple linear regression; neural networks; opaque models; representation languages; transparent models; Data mining; Decision making; Decision trees; Genetic programming; Linear regression; Neural networks; Parameter estimation; Predictive models; Robustness; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2007. CEC 2007. IEEE Congress on
  • Conference_Location
    Singapore
  • Print_ISBN
    978-1-4244-1339-3
  • Electronic_ISBN
    978-1-4244-1340-9
  • Type

    conf

  • DOI
    10.1109/CEC.2007.4424621
  • Filename
    4424621