DocumentCode
2528584
Title
Analysis of four different sets of predictive features for metalloproteins
Author
Seker, Huseyin ; Haris, Parvez I.
Author_Institution
Bio-Health Informatics Res. Group, De Montfort Univ., Leicester, UK
fYear
2005
fDate
8-11 Aug. 2005
Firstpage
228
Lastpage
229
Abstract
Metals bound to the protein are important for functional or structural roles. Despite their importance there is a distinct lack of research for identification of metalloproteins from sequence data and their predictive features that help distinguish them from non-metal binding proteins. In this study, four sets of features were analysed in order to see their ability to distinguish between metal and non-metal binding proteins. The analysis was carried out using a novel fuzzy logic method. The results show that the amino acid composition is more capable of distinguishing metal from non-metal binding proteins, than any of the other three features, yielding a predictive accuracy of 69.4%. Cofactors were the least useful feature for distinguishing metalloproteins. However, better results were obtained when physico-chemical and secondary structure features are used, yielding accuracies of 67.8% and 67.1%, respectively. Although the amino acid composition yields the highest predictive accuracy, considering the number of features, the latter two sets of features may be more appropriate for such analysis.
Keywords
biology computing; fuzzy logic; molecular biophysics; proteins; amino acid composition; fuzzy logic method; metal binding proteins; metal bound; metalloproteins; physico-chemical structure; protein; sequence data; Accuracy; Amino acids; Bioinformatics; Biological systems; Biomedical informatics; Computational intelligence; Fuzzy logic; Fuzzy sets; Proteins; Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2005. Workshops and Poster Abstracts. IEEE
Print_ISBN
0-7695-2442-7
Type
conf
DOI
10.1109/CSBW.2005.23
Filename
1540610
Link To Document