• DocumentCode
    1637986
  • Title

    A new preprocessing procedure for the haplotype inference problem

  • Author

    Irurozki, Ekhine ; Lozano, José A.

  • Author_Institution
    Intell. Syst. Group, Univ. of the Basque Country, San Sebastian
  • fYear
    2009
  • Firstpage
    1320
  • Lastpage
    1327
  • Abstract
    A haplotype is a DNA sequence that is inherited from one parent. They are especially important in the study of complex diseases since they contain more information than genotype data, so the next high priority phase in human genomics involves the development of a full haplotype map of human genome. However, obtaining haplotype data is technically difficult and expensive. One of the computational methods for obtaining haplotype data from genotype data is the pure parsimony criterion, an approach known as haplotype inference by pure parsimony (HIPP). It has been proved to be an NP-hard problem. We present a new preprocessing method which drastically decreases the number of relevant haplotypes. Several algorithms need to preprocess data; for big problem instances this key procedure is even more important than the process. This preprocessing was eventually tested on real and simulated data applying a tabu search, and the performance of the resulting algorithm showed it to be competitive with the best actual solvers.
  • Keywords
    biocomputing; computational complexity; genomics; optimisation; search problems; DNA sequence; NP-hard problem; complex diseases; genotype data; haplotype inference by pure parsimony; human genomics; preprocessing procedure; tabu search; Bioinformatics; DNA; Discrete event simulation; Diseases; Genomics; Humans; Inference algorithms; NP-hard problem; Sequences; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2009. CEC '09. IEEE Congress on
  • Conference_Location
    Trondheim
  • Print_ISBN
    978-1-4244-2958-5
  • Electronic_ISBN
    978-1-4244-2959-2
  • Type

    conf

  • DOI
    10.1109/CEC.2009.4983097
  • Filename
    4983097