• DocumentCode
    3230537
  • Title

    FMGA: finding motifs by genetic algorithm

  • Author

    Liu, Falcon F M ; Tsai, Jeffrey J P ; Chen, R.M. ; Chen, S.N. ; Shih, S.H.

  • Author_Institution
    Inst. of Bioinformatics, Taichung Healthcare & Manage. Univ., Taiwan
  • fYear
    2004
  • fDate
    19-21 May 2004
  • Firstpage
    459
  • Lastpage
    466
  • Abstract
    In the era of post-genomics, almost all the genes have been sequenced and enormous amounts of data have been generated. Hence, to mine useful information from these data is a very important topic. In this paper we propose a new approach for finding potential motifs in the regions located from the -2000 bp upstream to +1000 bp downstream of transcription start site (TSS). This new approach is developed based on the genetic algorithm (GA). The mutation in the GA is performed by using position weight matrices to reserve the completely conserved positions. The crossover is implemented with special-designed gap penalties to produce the optimal child pattern. We also present a rearrangement method based on position weight matrices to avoid the presence of a very stable local minimum, which may make it quite difficult for the other operators to generate the optimal pattern. Our approach shows superior results by comparing with multiple em for motif elicitation (MEME) and Gibbs sampler, which are two popular algorithms for finding motifs.
  • Keywords
    biology computing; data mining; genetic algorithms; genetics; pattern recognition; sequences; FMGA; data mining; gene sequences; genetic algorithm; motifs; optimal child pattern; position weight matrices; transcription start site; Artificial neural networks; Bioinformatics; DNA; Genetic algorithms; Medical services; Polymers; Proteins; RNA; Sequences; Technology management;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2004. BIBE 2004. Proceedings. Fourth IEEE Symposium on
  • Print_ISBN
    0-7695-2173-8
  • Type

    conf

  • DOI
    10.1109/BIBE.2004.1317378
  • Filename
    1317378