• DocumentCode
    1078761
  • Title

    Natural Encoding for Evolutionary Supervised Learning

  • Author

    Aguilar-Ruiz, JesÙs S. ; Giráldez, RaÙl ; Riquelme, José C.

  • Author_Institution
    Pablo de Olavide Univ., Seville
  • Volume
    11
  • Issue
    4
  • fYear
    2007
  • Firstpage
    466
  • Lastpage
    479
  • Abstract
    Some of the most influential factors in the quality of the solutions found by an evolutionary algorithm (EA) are a correct coding of the search space and an appropriate evaluation function of the potential solutions. EAs are often used to learn decision rules from datasets, which are encoded as individuals in the genetic population. In this paper, the coding of the search space for the obtaining of those decision rules is approached, i.e., the representation of the individuals of the genetic population and also the design of specific genetic operators. Our approach, called "natural coding," uses one gene per feature in the dataset (continuous or discrete). The examples from the datasets are also encoded into the search space, where the genetic population evolves, and therefore the evaluation process is improved substantially. Genetic operators for the natural coding are formally defined as algebraic expressions. Experiments with several datasets from the University of California at Irvine (UCI) machine learning repository show that as the genetic operators are better guided through the search space, the number of rules decreases considerably while maintaining the accuracy, similar to that of hybrid coding, which joins the well-known binary and real representations to encode discrete and continuous attributes, respectively. The computational cost associated with the natural coding is also reduced with regard to the hybrid representation. Our algorithm, HlDER*, has been statistically tested against C4.5 and C4.5 Rules, and performed well. The knowledge models obtained are simpler, with very few decision rules, and therefore easier to understand, which is an advantage in many domains. The experiments with high-dimensional datasets showed the same good behavior, maintaining the quality of the knowledge model with respect to prediction accuracy.
  • Keywords
    genetic algorithms; knowledge representation; learning (artificial intelligence); search problems; HlDER* algorithm; algebraic expressions; continuous attributes; decision rule learning; discrete attributes; evolutionary algorithm; evolutionary supervised learning; genetic operators; genetic population; machine learning; natural coding; natural encoding; search space coding; Accuracy; Computational efficiency; Encoding; Evolutionary computation; Genetics; Machine learning; Performance evaluation; Predictive models; Supervised learning; Testing; Decision rules; evolutionary encoding; supervised learning;
  • fLanguage
    English
  • Journal_Title
    Evolutionary Computation, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-778X
  • Type

    jour

  • DOI
    10.1109/TEVC.2006.883466
  • Filename
    4280854