Title :
A maximum likelihood based genetic algorithm for inferring haplotypes from genotypes
Author :
Lakshminarasimhan, P. ; Marmelstein, Robert ; Devito, Mary ; Che, Dongsheng ; Liu, Qi
Author_Institution :
Dept. of Comput. Sci., East Stroudsburg Univ., East Stroudsburg, PA, USA
Abstract :
A haplotype is a set of single nucleotide polymorphisms (SNPs) from a given chromosome, and provides valuable information about complex diseases. Current practices that the inferring of large scale SNP haplotypes from raw SNP data (genotypes) using computational approaches has gained a lot of attention, but it presents a grand challenges as it is inherently a NP-Hard problem. In this paper, we propose a heuristic approach, Genetic Algorithm (GA) model for the haplotypes inference method, based on the maximum-likelihood estimates of haplotype frequencies under the assumption of Hardy-Weinberg proportions. The goal of the genetic algorithm method is to obtain high prediction accuracy within a reasonable computing time. The performance of our model was evaluated on both simulated datasets and real datasets, and these results are promising, indicating that our model is a potential computational tool for haplotype inferences.
Keywords :
cellular biophysics; diseases; genetic algorithms; genetics; maximum likelihood estimation; Hardy- Weinberg proportion; SNP data; genetic algorithm; genotypes; haplotype frequency; haplotypes inference method; maximum likelihood estimates; np hard problem; single nucleotide polymorphism; Accuracy; Bioinformatics; Biological cells; Computational modeling; Diseases; Frequency estimation; Genetic algorithms; Humans; Large-scale systems; Maximum likelihood estimation; Genetic Algorithm; Genotypes; Haplotype Inference; Haplotypes; SNP sites;
Conference_Titel :
Education Technology and Computer (ICETC), 2010 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6367-1
DOI :
10.1109/ICETC.2010.5530004