DocumentCode :
667221
Title :
Prediction of enzymatic activity of proteins based on structural and functional domains
Author :
Koutsandreas, Theodoros G. ; Pilalis, Eleftherios D. ; Chatziioannou, Aristotelis A.
Author_Institution :
Metabolic Eng. & Bioinf. Program, Nat. Hellenic Res. Found., Athens, Greece
fYear :
2013
fDate :
10-13 Nov. 2013
Firstpage :
1
Lastpage :
3
Abstract :
The prediction of the putative enzymatic function of uncharacterized proteins is a major problem in the field of metagenomic research, where large amounts of sequences can be rapidly determined. In this work a machine-learning approach was developed, that attempts the prediction of enzymatic activity based on three protein domain databases, PFAM, CATH and SCOP, which contain functional and structural information of proteins as Hidden Markov Models. Separate and combined classifiers were trained by well-annotated data and their performance was assessed in order to compare the predictive power of different attribute sets corresponding to the three protein domain databases. All classifiers performed well, with an average accuracy of ~96% and an average AUC score of 0.84. As a conclusion, the classification procedure can be integrated to more extended metagenomic analysis workflows.
Keywords :
biochemistry; biology computing; enzymes; genomics; hidden Markov models; learning (artificial intelligence); molecular biophysics; molecular configurations; pattern classification; AUC score; CATH; PFAM; SCOP; classification procedure; enzymatic activity; functional domains; functional information; hidden Markov models; machine-learning approach; metagenomic research; protein domain databases; putative enzymatic function; structural domains; structural information; Accuracy; Biotechnology; Databases; Hidden Markov models; Proteins; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
Conference_Location :
Chania
Type :
conf
DOI :
10.1109/BIBE.2013.6701559
Filename :
6701559
Link To Document :
بازگشت