• DocumentCode
    84174
  • Title

    A Robust Elicitation Algorithm for Discovering DNA Motifs Using Fuzzy Self-Organizing Maps

  • Author

    Dianhui Wang ; Tapan, Sarwar

  • Author_Institution
    Dept. of Comput. Sci. & Comput. Eng., La Trobe Univ., Melbourne, VIC, Australia
  • Volume
    24
  • Issue
    10
  • fYear
    2013
  • fDate
    Oct. 2013
  • Firstpage
    1677
  • Lastpage
    1688
  • Abstract
    It is important to identify DNA motifs in promoter regions to understand the mechanism of gene regulation. Computational approaches for finding DNA motifs are well recognized as useful tools to biologists, which greatly help in saving experimental time and cost in wet laboratories. Self-organizing maps (SOMs), as a powerful clustering tool, have demonstrated good potential for problem solving. However, the current SOM-based motif discovery algorithms unfairly treat data samples lying around the cluster boundaries by assigning them to one of the nodes, which may result in unreliable system performance. This paper aims to develop a robust framework for discovering DNA motifs, where fuzzy SOMs, with an integration of fuzzy c-means membership functions and a standard batch-learning scheme, are employed to extract putative motifs with varying length in a recursive manner. Experimental results on eight real datasets show that our proposed algorithm outperforms the other searching tools such as SOMBRERO, SOMEA, MEME, AlignACE, and WEEDER in terms of the F-measure and algorithm reliability. It is observed that a remarkable 24.6% improvement can be achieved compared to the state-of-the-art SOMBRERO. Furthermore, our algorithm can produce a 20% and 6.6% improvement over SOMBRERO and SOMEA, respectively, in finding multiple motifs on five artificial datasets.
  • Keywords
    DNA; biology computing; fuzzy set theory; genetics; learning (artificial intelligence); self-organising feature maps; DNA motifs; SOM; batch learning scheme; fuzzy c-means membership functions; fuzzy self-organizing maps; gene regulation; problem solving; robust elicitation algorithm; Clustering algorithms; Computational modeling; DNA; Measurement; Prototypes; Robustness; Training; Computational motif discovery; DNA sequences; fuzzy self-organizing maps; robust elicitation algorithm;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2013.2275733
  • Filename
    6579728