DocumentCode
84174
Title
A Robust Elicitation Algorithm for Discovering DNA Motifs Using Fuzzy Self-Organizing Maps
Author
Dianhui Wang ; Tapan, Sarwar
Author_Institution
Dept. of Comput. Sci. & Comput. Eng., La Trobe Univ., Melbourne, VIC, Australia
Volume
24
Issue
10
fYear
2013
fDate
Oct. 2013
Firstpage
1677
Lastpage
1688
Abstract
It is important to identify DNA motifs in promoter regions to understand the mechanism of gene regulation. Computational approaches for finding DNA motifs are well recognized as useful tools to biologists, which greatly help in saving experimental time and cost in wet laboratories. Self-organizing maps (SOMs), as a powerful clustering tool, have demonstrated good potential for problem solving. However, the current SOM-based motif discovery algorithms unfairly treat data samples lying around the cluster boundaries by assigning them to one of the nodes, which may result in unreliable system performance. This paper aims to develop a robust framework for discovering DNA motifs, where fuzzy SOMs, with an integration of fuzzy c-means membership functions and a standard batch-learning scheme, are employed to extract putative motifs with varying length in a recursive manner. Experimental results on eight real datasets show that our proposed algorithm outperforms the other searching tools such as SOMBRERO, SOMEA, MEME, AlignACE, and WEEDER in terms of the F-measure and algorithm reliability. It is observed that a remarkable 24.6% improvement can be achieved compared to the state-of-the-art SOMBRERO. Furthermore, our algorithm can produce a 20% and 6.6% improvement over SOMBRERO and SOMEA, respectively, in finding multiple motifs on five artificial datasets.
Keywords
DNA; biology computing; fuzzy set theory; genetics; learning (artificial intelligence); self-organising feature maps; DNA motifs; SOM; batch learning scheme; fuzzy c-means membership functions; fuzzy self-organizing maps; gene regulation; problem solving; robust elicitation algorithm; Clustering algorithms; Computational modeling; DNA; Measurement; Prototypes; Robustness; Training; Computational motif discovery; DNA sequences; fuzzy self-organizing maps; robust elicitation algorithm;
fLanguage
English
Journal_Title
Neural Networks and Learning Systems, IEEE Transactions on
Publisher
ieee
ISSN
2162-237X
Type
jour
DOI
10.1109/TNNLS.2013.2275733
Filename
6579728
Link To Document