Title :
Multiple worlds model for motif discovery
Author :
Brown, Joseph Alexander
Author_Institution :
Sch. of Comput. Sci., Univ. of Guelph, Guelph, ON, Canada
Abstract :
In this study we look at a novel evolutionary technique known as the multiple worlds model for unsupervised classification of sequences via evolved motifs. This evolutionary algorithm uses the biological inspirations of species and species extinction as features modeled in the evolution in order to provide classifiers where the number of classes is not known a priori. In the multiple worlds model a number of populations which do not interbreed compete in fitness evaluation. Sequence motifs are small, biologically significant DNA/RNA or amino acid segments. They are represented as strings of symbols and wild cards. The model works well to locate classification motifs for sequences whose classes have statistical deviations such as those with a GC content difference or those created from differing Self-Driving Markov models. The creation of such classifiers will allow biologists to examine large sets of sequences in order to discover significant features.
Keywords :
DNA; Markov processes; RNA; bioinformatics; classification; evolutionary computation; molecular biophysics; pattern classification; DNA-RNA; amino acid segments; biological inspirations; biologists; classifiers; evolutionary algorithm; fitness evaluation; motif discovery; multiple worlds model; self-driving Markov models; sequence classification; sequence motifs; species extinction; statistical deviations; wild cards; Biological system modeling; DNA; Entropy; Indexes; Markov processes; RNA; Bioinformatics; Evolutionary algorithms; Motif finding; Multiple World´s Model; Planted Motif;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2012 IEEE Symposium on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4673-1190-8
DOI :
10.1109/CIBCB.2012.6217216