Title :
More multiple worlds evolution for motif discovery
Author_Institution :
Sch. of Comput. Sci., Univ. of Guelph, Guelph, ON, Canada
Abstract :
The Multiple Worlds Model of evolution is a spatially structured evolutionary algorithm which uses the ideas of Darwin´s finches as a motivating idea. Through multiple populations separated genetically but with a unified fitness evaluation, Multiple Worlds acts to partition data via specialization. Each of the populations must specialize in order to gain fitness, or a population can be reduced to little or no fitness. Such a drop in fitness implies that the number of populations, each representing a class in the data, is too large. The number of natural classes in the data is discovered by the algorithm via an analog to biological extinction. This study examines the application of this method to discovery of degenerate motifs on two types of data. The first is the classification of synthetic motifs, created by a self-driving finite state machine, selected to yield high-entropy data. The second is a biological example comprising two classes of data drawn from a Human Leukocyte Antigen data set. The classifiers found not only allow for the division of the data, but are expressed as degenerate motifs; granting researchers a comprehensible, reusable result.
Keywords :
bioinformatics; cellular biophysics; classification; entropy; Darwin finches; algorithm; analog; bioinformatics; biological extinction; classification; high-entropy data; human leukocyte antigen data set; motif discovery; self-driving finite state machine; specialization; structured evolutionary algorithm; Biological system modeling; Evolution (biology); Evolutionary computation; Indexes; Sociology; Statistics; Bioinformics; Evolutionary algorithms; Motif finding; Multiple Worlds Model; Planted Motif;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2013 IEEE Symposium on
Conference_Location :
Singapore
DOI :
10.1109/CIBCB.2013.6595404