DocumentCode :
2319378
Title :
Multiple worlds model for motif discovery
Author :
Brown, Joseph Alexander
Author_Institution :
Sch. of Comput. Sci., Univ. of Guelph, Guelph, ON, Canada
fYear :
2012
fDate :
9-12 May 2012
Firstpage :
92
Lastpage :
99
Abstract :
In this study we look at a novel evolutionary technique known as the multiple worlds model for unsupervised classification of sequences via evolved motifs. This evolutionary algorithm uses the biological inspirations of species and species extinction as features modeled in the evolution in order to provide classifiers where the number of classes is not known a priori. In the multiple worlds model a number of populations which do not interbreed compete in fitness evaluation. Sequence motifs are small, biologically significant DNA/RNA or amino acid segments. They are represented as strings of symbols and wild cards. The model works well to locate classification motifs for sequences whose classes have statistical deviations such as those with a GC content difference or those created from differing Self-Driving Markov models. The creation of such classifiers will allow biologists to examine large sets of sequences in order to discover significant features.
Keywords :
DNA; Markov processes; RNA; bioinformatics; classification; evolutionary computation; molecular biophysics; pattern classification; DNA-RNA; amino acid segments; biological inspirations; biologists; classifiers; evolutionary algorithm; fitness evaluation; motif discovery; multiple worlds model; self-driving Markov models; sequence classification; sequence motifs; species extinction; statistical deviations; wild cards; Biological system modeling; DNA; Entropy; Indexes; Markov processes; RNA; Bioinformatics; Evolutionary algorithms; Motif finding; Multiple World´s Model; Planted Motif;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2012 IEEE Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-1190-8
Type :
conf
DOI :
10.1109/CIBCB.2012.6217216
Filename :
6217216
Link To Document :
بازگشت