• DocumentCode
    2077723
  • Title

    Automated discovery of detectors and iteration-performing calculations to recognize patterns in protein sequences using genetic programming

  • Author

    Koza, John R.

  • Author_Institution
    Dept. of Comput. Sci., Stanford Univ., CA, USA
  • fYear
    1994
  • fDate
    21-23 Jun 1994
  • Firstpage
    684
  • Lastpage
    689
  • Abstract
    This paper describes an automated process for the dynamic creation of a pattern-recognizing computer program consisting of initially unknown detectors, an initially-unknown iterative calculation incorporating the as-yet-uncreated detectors, and an initially-unspecified final calculation incorporating the results of the as-yet-uncreated iteration. The program´s goal is to recognize a given protein segment as being a transmembrane domain or non-transmembrane area. The recognizing program to solve this problem will be evolved using the recently developed genetic programming paradigm. Genetic programming starts with a primordial ooze of randomly generated computer programs composed of available programmatic ingredients and then genetically breeds the population using the Darwinian principle of survival of the fittest and the genetic crossover (sexual recombination) operation. Automatic function definition enables genetic programming to dynamically create subroutines (detectors). When cross-validated, the best genetically-evolved recognizer achieves an out-of-sample correlation of 0.968 and an out-of-sample error rate of 1.6%. This error rate is better than that recently reported for five other methods
  • Keywords
    genetic algorithms; medical computing; pattern recognition; proteins; genetic programming; genetically-evolved recognizer; iteration-performing calculations; pattern recognition; protein segment; protein sequences; transmembrane domain; Biomedical computing; Pattern recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 1994. Proceedings CVPR '94., 1994 IEEE Computer Society Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1063-6919
  • Print_ISBN
    0-8186-5825-8
  • Type

    conf

  • DOI
    10.1109/CVPR.1994.323778
  • Filename
    323778