• DocumentCode
    2221649
  • Title

    Change-point detection in biological sequences via genetic algorithm

  • Author

    Polushina, Tatiana ; Sofronov, Georgy

  • Author_Institution
    Dept. of Math., Mari State Univ., Yoshkar-Ola, Russia
  • fYear
    2011
  • fDate
    5-8 June 2011
  • Firstpage
    1966
  • Lastpage
    1971
  • Abstract
    Genome research is one of the most interesting and important areas of the science nowadays. It is well-known that the genomes of complex organisms are highly organized. Many studies show that DNA sequence can be divided into a few segments, which have various properties of interest. Detection of this segments is extremely significant from the point of view of practical applications, as well as for understanding evolutional processes. We model genome sequences as a multiple change-point process, that is, a process in which sequential data are divided into segments by an unknown number of change-points, with each segment supposed to have been generated by a process with different parameters. Multiple change-point models are important in many biological applications and, specifically, in analysis of biomolecular sequences. In this paper, we propose to use genetic algorithm to identify change-points. Numerical experiments illustrate the effectiveness of our approach to the problem. We obtain estimates for the positions of change-points in artificially generated sequences and compare the accuracy of these estimates to those obtained via Markov chain Monte Carlo and the Cross-Entropy method. We also provide examples with real data sets to illustrate the usefulness of our method.
  • Keywords
    DNA; Markov processes; Monte Carlo methods; cellular biophysics; genetic algorithms; genomics; molecular biophysics; molecular configurations; DNA sequence; Markov chain Monte Carlo method; biological sequences; biomolecular sequences; change-point detection; complex organisms; cross-entropy method; evolutional processes; genetic algorithm; genome sequences; Biological cells; Biological system modeling; DNA; Genetic algorithms; Genomics; Monte Carlo methods; change-point problem; combinatorial optimization; comparative genomics; genetic algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation (CEC), 2011 IEEE Congress on
  • Conference_Location
    New Orleans, LA
  • ISSN
    Pending
  • Print_ISBN
    978-1-4244-7834-7
  • Type

    conf

  • DOI
    10.1109/CEC.2011.5949856
  • Filename
    5949856