• DocumentCode
    2094160
  • Title

    A Novel Approach to Extract Structured Motifs by Multi-Objective Genetic Algorithm

  • Author

    Kaya, Mehmet ; Guc, M.

  • Author_Institution
    Dept. of Comput. Eng., Firat Univ., Elazg
  • fYear
    2008
  • fDate
    17-19 June 2008
  • Firstpage
    278
  • Lastpage
    283
  • Abstract
    The functional motifs composed of several sequential blocks are difficult to find. Current mining methods might individually find each motif block but fail to connect them with large irregular gaps. In this paper we propose a novel method for the efficient extraction of structured motifs from DNA sequences using multi-objective genetic algorithm. The main advantage of our approach is that a large number of nondominated motifs can be obtained by a single run with respect to conflicting objectives: similarity and support maximization and gap minimization. To the best of our knowledge, this is the first effort in this direction. The proposed method can be applied to any data set with a sequential character. Furthermore, it allows any choice of similarity measures for finding motifs. By analyzing the obtained optimal motifs, the decision maker can understand the tradeoff between the objectives. We compare our method with the two well-known structured motif extraction methods, EXMOTIF and RISOTTO. Experimental results on synthetics data set demonstrate that the proposed method exhibits good performance over the other methods in terms of runtime.
  • Keywords
    DNA; biology computing; data mining; feature extraction; genetic algorithms; sequences; DNA sequences; data mining; multiobjective genetic algorithm; structured motifs extraction; Bioinformatics; Biomedical engineering; DNA; Data engineering; Data mining; Genetic algorithms; Genetic engineering; Laboratories; Runtime; Sequences; multi-objective genetic algorithm; structured motif discovery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer-Based Medical Systems, 2008. CBMS '08. 21st IEEE International Symposium on
  • Conference_Location
    Jyvaskyla
  • ISSN
    1063-7125
  • Print_ISBN
    978-0-7695-3165-6
  • Type

    conf

  • DOI
    10.1109/CBMS.2008.99
  • Filename
    4562001