Title :
Multi-class Protein Sequence Classification Using Fuzzy ARTMAP
Author :
Mohamed, Shakir ; Rubin, David ; Marwala, Tshilidzi
Author_Institution :
Univ. of the Witwatersrand, Johannesburg
Abstract :
The classification of protein sequences into families is an important tool in the annotation of structural and functional properties to newly discovered proteins. We present a classification system using pattern recognition techniques to create a numerical vector representation of a protein sequence and then classify the sequence into a number of given families. We introduce the use of fuzzy ARTMAP classifiers and show that coupled with the genetic algorithm based feature subset selection, the system is able to classify protein sequences with an accuracy of 93%. This accuracy is compared with numerous other classification tools and demonstrates that the fuzzy ARTMAP is suitable due to its high accuracy, quick training times and ability for incremental learning.
Keywords :
biology computing; genetic algorithms; pattern classification; proteins; fuzzy ARTMAP classifiers; genetic algorithm; multiclass protein sequence classification; pattern recognition techniques; protein sequences classification; Amino acids; Cybernetics; Databases; Fuzzy systems; Genetic algorithms; Humans; Pattern recognition; Protein engineering; Protein sequence; Testing;
Conference_Titel :
Systems, Man and Cybernetics, 2006. SMC '06. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
1-4244-0099-6
Electronic_ISBN :
1-4244-0100-3
DOI :
10.1109/ICSMC.2006.384960