Title :
A Novel Scoring Based Distributed Protein Docking Application to Improve Enrichment
Author :
Pradeep, Prachi ; Struble, Craig ; Neumann, Terrence ; Sem, Daniel S. ; Merrill, Stephen J.
Author_Institution :
Dept. of Math., Stat., & Comput. Sci., Marquette Univ., Marquette, WI, USA
Abstract :
Molecular docking is a computational technique which predicts the binding energy and the preferred binding mode of a ligand to a protein target. Virtual screening is a tool which uses docking to investigate large chemical libraries to identify ligands that bind favorably to a protein target. We have developed a novel scoring based distributed protein docking application to improve enrichment in virtual screening. The application addresses the issue of time and cost of screening in contrast to conventional systematic parallel virtual screening methods in two ways. Firstly, it automates the process of creating and launching multiple independent dockings on a high performance computing cluster. Secondly, it uses a Nȧive Bayes scoring function to calculate binding energy of un-docked ligands to identify and preferentially dock (Autodock predicted) better binders. The application was tested on four proteins using a library of 10,573 ligands. In all the experiments, (i). 200 of the 1,000 best binders are identified after docking only ~14 percent of the chemical library, (ii). 9 or 10 best-binders are identified after docking only ~19 percent of the chemical library, and (iii). no significant enrichment is observed after docking ~70 percent of the chemical library. The results show significant increase in enrichment of potential drug leads in early rounds of virtual screening.
Keywords :
Bayes methods; binding energy; biology computing; molecular biophysics; proteins; Naive Bayes scoring function; binding energy; chemical libraries; enrichment; high performance computing cluster; ligand; molecular docking; scoring based distributed protein docking; virtual screening; Bioinformatics; Chemicals; Computational biology; Drugs; Proteins; Distributed Protein Docking; HTCondor; High Performance Computing; N??i??ve Bayes; Naive Bayes; Scoring Function; Scoring function; Virtual Screening; Virtual screening; distributed protein docking; high performance computing;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2015.2401020