Title :
PhospredRF: Prediction of protein phosphorylation sites using a consensus of random forest classifiers
Author :
Sagnik Banerjee;Subhadip Basu;Debjyoti Ghosh;Mita Nasipuri
Author_Institution :
Department of Electronics and Communication Engineering, Institute of Engineering and Management, Kolkata, India
Abstract :
Post translational modification (PTM) is a process by which proteins undergo chemical changes after they are translated from RNA. Among the various types of PTM, phosphorylation is the most important one since it assists in almost all the activities of the cell. In this research work we have used machine learning based approaches to predict the position where phosphorylation has occurred. Random forest has been used as the machine learning tool for this work. As features we have used evolutionary information extracted from Position Specific Scoring Matrices (PSSM). When tested with an independent set of 141 proteins our system achieved an AUC of 0.699. Also our system could attain the best performance for a set of 22 non-trivial proteins.
Keywords :
"Proteins","Amino acids","Feature extraction","Entropy","Training","Databases","Redundancy"
Conference_Titel :
Computing and Communication (IEMCON), 2015 International Conference and Workshop on
DOI :
10.1109/IEMCON.2015.7344514