DocumentCode :
2077723
Title :
Automated discovery of detectors and iteration-performing calculations to recognize patterns in protein sequences using genetic programming
Author :
Koza, John R.
Author_Institution :
Dept. of Comput. Sci., Stanford Univ., CA, USA
fYear :
1994
fDate :
21-23 Jun 1994
Firstpage :
684
Lastpage :
689
Abstract :
This paper describes an automated process for the dynamic creation of a pattern-recognizing computer program consisting of initially unknown detectors, an initially-unknown iterative calculation incorporating the as-yet-uncreated detectors, and an initially-unspecified final calculation incorporating the results of the as-yet-uncreated iteration. The program´s goal is to recognize a given protein segment as being a transmembrane domain or non-transmembrane area. The recognizing program to solve this problem will be evolved using the recently developed genetic programming paradigm. Genetic programming starts with a primordial ooze of randomly generated computer programs composed of available programmatic ingredients and then genetically breeds the population using the Darwinian principle of survival of the fittest and the genetic crossover (sexual recombination) operation. Automatic function definition enables genetic programming to dynamically create subroutines (detectors). When cross-validated, the best genetically-evolved recognizer achieves an out-of-sample correlation of 0.968 and an out-of-sample error rate of 1.6%. This error rate is better than that recently reported for five other methods
Keywords :
genetic algorithms; medical computing; pattern recognition; proteins; genetic programming; genetically-evolved recognizer; iteration-performing calculations; pattern recognition; protein segment; protein sequences; transmembrane domain; Biomedical computing; Pattern recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition, 1994. Proceedings CVPR '94., 1994 IEEE Computer Society Conference on
Conference_Location :
Seattle, WA
ISSN :
1063-6919
Print_ISBN :
0-8186-5825-8
Type :
conf
DOI :
10.1109/CVPR.1994.323778
Filename :
323778
Link To Document :
بازگشت