DocumentCode :
2582739
Title :
A multi-level approach to SCOP fold recognition
Author :
Marsolo, Keith ; Parthasarathy, Srinivasan ; Ding, Chris
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
fYear :
2005
fDate :
19-21 Oct. 2005
Firstpage :
57
Lastpage :
64
Abstract :
The classification of proteins based on their structure can play an important role in the deduction or discovery of protein function. However, the relatively low number of solved protein structures and the unknown relationship between structure and sequence requires an alternative method of representation for classification to be effective. Furthermore, the large number of potential folds causes problems for many classification strategies, increasing the likelihood that the classifier will reach a local optima while trying to distinguish between all of the possible structural categories. Here we present a hierarchical strategy for structural classification that first partitions proteins based on their SCOP class before attempting to assign a protein fold. Using a well-known dataset derived from the 27 most-populated SCOP folds and several sequence-based descriptor properties as input features, we test a number of classification methods, including Naive Bayes and Boosted C4.5. Our strategy achieves an average fold recognition of 74%, which is significantly higher than the 56-60% previously reported in the literature, indicating the effectiveness of a multi-level approach.
Keywords :
Bayes methods; biology computing; molecular biophysics; molecular configurations; proteins; Boosted C4.5; Naive Bayes; SCOP fold recognition; hierarchical strategy; multi-level approach; protein function; protein sequence; protein structural classification; sequence-based descriptor properties; Bioinformatics; Biological processes; Computer science; Databases; Genomics; Laboratories; Protein engineering; Proteomics; Sequences; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering, 2005. BIBE 2005. Fifth IEEE Symposium on
Print_ISBN :
0-7695-2476-1
Type :
conf
DOI :
10.1109/BIBE.2005.5
Filename :
1544449
Link To Document :
بازگشت