DocumentCode :
1637872
Title :
Sensible initialization using expert knowledge for genome-wide analysis of epistasis using genetic programming
Author :
Greene, Casey S. ; White, Bill C. ; Moore, Jason H.
Author_Institution :
Dept. of Genetics, Dartmouth Med. Sch., Lebanon, NH
fYear :
2009
Firstpage :
1289
Lastpage :
1296
Abstract :
For biomedical researchers it is now possible to measure large numbers of DNA sequence variations across the human genome. Measuring hundreds of thousands of variations is now routine, but single variations which consistently predict an individual´s risk of common human disease have proven elusive. Instead of single variants determining the risk of common human diseases, it seems more likely that disease risk is best modeled by interactions between biological components. The evolutionary computing challenge now is to effectively explore interactions in these large datasets and identify combinations of variations which are robust predictors of common human diseases such as bladder cancer. One promising approach to this problem is genetic programming (GP). A GP approach for this problem will use darwinian inspired evolution to evolve programs which find and model attribute interactions which predict an individual´s risk of common human diseases. The goal of this study is to develop and evaluate two initializers for this domain. We develop a probabilistic initializer which uses expert knowledge to select attributes and an enumerative initializer which maximizes attribute diversity in the generated population.We compare these initializers to a random initializer which displays no preference for attributes. We show that the expert-knowledge-aware probabilistic initializer significantly outperforms both the random initializer and the enumerative initializer.We discuss implications of these results for the design of GP strategies which are able to detect and characterize predictors of common human diseases.
Keywords :
DNA; diseases; genetic algorithms; genetics; medical expert systems; probability; DNA sequence variations; biomedical research; epistasis; expert knowledge; genetic programming; genome-wide analysis; human disease; human genome; probabilistic initializer; sensible initialization; Bioinformatics; Biological system modeling; Biology computing; Biomedical measurements; DNA; Diseases; Genetic programming; Genomics; Humans; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation, 2009. CEC '09. IEEE Congress on
Conference_Location :
Trondheim
Print_ISBN :
978-1-4244-2958-5
Electronic_ISBN :
978-1-4244-2959-2
Type :
conf
DOI :
10.1109/CEC.2009.4983093
Filename :
4983093
Link To Document :
بازگشت