DocumentCode :
3177554
Title :
Prediction of Catalytic Residues in Proteins Using a Consensus of Prediction (CoP) Approach
Author :
Petrova, Natalia V. ; Wu, Cathy H.
Author_Institution :
Dept. of Comput. & Inf. Sci., Univ. of Delaware, Newark, DE, USA
fYear :
2010
fDate :
May 31 2010-June 3 2010
Firstpage :
226
Lastpage :
231
Abstract :
One of the aims of the Protein Structure Initiative (PSI) in the post genome-sequencing era is to elucidate biochemical and biophysical functions of each protein structure. Thus, the development of new methods for a large-scale analysis/annotation of protein functional residues is inevitable. Currently existing methods are not capable to do so due to the lack of automation, availability, and/or poor performance. In our previous work we were able to improve the accuracy of the prediction to ~86%, although the number of false-positives remained high. In this paper we present a fully-automated method for the prediction of catalytic residues in proteins that improves accuracy by reduction of false-positives, and is applicable for a large-scale analysis. Here, catalytic residues are predicted by machine learning approach followed by hierarchical analysis of the predicted residues. The capability of the method was tested on diverse family of hydrolytic enzymes with a/b hydrolase fold with widely differing phylogenetic origins and catalytic functions. The method was executed manually and then fully reproduces automatically. In the manual analysis, in 17 enzymes, the method correctly predicted all 3 residues of the catalytic triad with 3 false-positives out of 282 residues on average. Our method successfully eliminates the number of false-positives, while being applicable for a large-scale analysis of the protein function.
Keywords :
biochemistry; biological techniques; biology computing; catalysis; enzymes; genomics; learning (artificial intelligence); molecular biophysics; molecular configurations; proteins; a-b hydrolase fold; biochemical function; biophysical function; catalytic residues; catalytic triad; consensus-of-prediction approach; diverse family; fully-automated method; genome-sequencing era; hierarchical analysis; hydrolytic enzymes; large-scale analysis; machine learning approach; phylogenetic origins; protein annotation; protein functional residues; protein structure initiative; Accuracy; Automation; Availability; Biochemistry; Bioinformatics; Genomics; Large-scale systems; Machine learning; Protein engineering; Testing; CoP; catalytic residue; conservation of prediction; large-scale analysis; machine leaning; prediction; protein function;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
BioInformatics and BioEngineering (BIBE), 2010 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4244-7494-3
Type :
conf
DOI :
10.1109/BIBE.2010.44
Filename :
5521686
Link To Document :
بازگشت