Title :
Benchmarking and analysis of DNA-binding site prediction using machine learning
Abstract :
We benchmarked the performance of machine learning based publicly available Web servers, predicting DNA-binding residues. A blind test on data sets derived from protein-DNA complexes submitted to PDB after the publication of these Web servers was performed. It was discerned that models trained on unusually large number of parameters show exaggerated performance during training, which could not be sustained on new proteins submitted after these publications. Also discussed are the optimum definition of binding site and a correspondence between residue propensity and its bias for predictability in positive and negative class.
Keywords :
DNA; Internet; biocomputing; biology computing; learning (artificial intelligence); DNA-binding site prediction; Web servers; benchmarking; machine learning; Machine learning; Neural networks;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4634034