DocumentCode
2077723
Title
Automated discovery of detectors and iteration-performing calculations to recognize patterns in protein sequences using genetic programming
Author
Koza, John R.
Author_Institution
Dept. of Comput. Sci., Stanford Univ., CA, USA
fYear
1994
fDate
21-23 Jun 1994
Firstpage
684
Lastpage
689
Abstract
This paper describes an automated process for the dynamic creation of a pattern-recognizing computer program consisting of initially unknown detectors, an initially-unknown iterative calculation incorporating the as-yet-uncreated detectors, and an initially-unspecified final calculation incorporating the results of the as-yet-uncreated iteration. The program´s goal is to recognize a given protein segment as being a transmembrane domain or non-transmembrane area. The recognizing program to solve this problem will be evolved using the recently developed genetic programming paradigm. Genetic programming starts with a primordial ooze of randomly generated computer programs composed of available programmatic ingredients and then genetically breeds the population using the Darwinian principle of survival of the fittest and the genetic crossover (sexual recombination) operation. Automatic function definition enables genetic programming to dynamically create subroutines (detectors). When cross-validated, the best genetically-evolved recognizer achieves an out-of-sample correlation of 0.968 and an out-of-sample error rate of 1.6%. This error rate is better than that recently reported for five other methods
Keywords
genetic algorithms; medical computing; pattern recognition; proteins; genetic programming; genetically-evolved recognizer; iteration-performing calculations; pattern recognition; protein segment; protein sequences; transmembrane domain; Biomedical computing; Pattern recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 1994. Proceedings CVPR '94., 1994 IEEE Computer Society Conference on
Conference_Location
Seattle, WA
ISSN
1063-6919
Print_ISBN
0-8186-5825-8
Type
conf
DOI
10.1109/CVPR.1994.323778
Filename
323778
Link To Document