DocumentCode
167344
Title
Exploring Large Scale Receptor-Ligand Pairs in Molecular Docking Workflows in HPC Clouds
Author
Ocana, K. ; Benza, S. ; De Oliveira, D. ; Dias, J. ; Mattoso, M.
Author_Institution
COPPE, Fed. Univ. of Rio de Janeiro - UFRJ, Rio de Janeiro, Brazil
fYear
2014
fDate
19-23 May 2014
Firstpage
536
Lastpage
545
Abstract
Computer-aided drug design techniques are important assets in pharmaceutical industry because of their support for research and development of new drugs. Molecular docking (MD) predicts specific compound´s binding modes within the active site of target proteins. Since MD is a time-consuming process, existing approaches reduce the number of receptors or ligands in docking by evaluating only small sets of compounds. This restriction in the search space reduces the chances to uniformly cover the diverse space of compounds and misses opportunities to recognize whether new drugs can be identified. Another difficulty with large-scale is analyzing the results, e.g. browsing all directories manually to find which pairs were docked successfully. To address these issues we explored the potential of data provenance analysis and parallel processing of SciCumulus, a cloud Scientific Workflow Management System. We present SciDock, a molecular docking-based virtual screening workflow and evaluate its execution using 10,000 receptor-ligand pairs related to proteases enzymes of protozoan genomes. The overall performance of SciDock using 32 cores, in cloud virtual machines, reaches improvements up to 95.4% when running SciDock with AutoDock and 96.1% when running SciDock with Vina. We show how data provenance improved the result analysis and how it may indicate potential proteases drug targets for protozoan treatments.
Keywords
CAD; cloud computing; drugs; enzymes; genomics; medical computing; molecular biophysics; parallel processing; pharmaceutical industry; HPC clouds; Large Scale Receptor-Ligand Pairs; SciCumulus; SciDock; cloud scientific workflow management system; cloud virtual machines; compound binding modes; computer-aided drug design techniques; data provenance analysis; enzymes; molecular docking-based virtual screening workflow; parallel processing; pharmaceutical industry; protozoan genomes; search space; time-consuming process; Complexity theory; Compounds; Databases; Drugs; Engines; Parallel processing; Proteins; cloud; drug discovery; workflow;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing Symposium Workshops (IPDPSW), 2014 IEEE International
Conference_Location
Phoenix, AZ
Print_ISBN
978-1-4799-4117-9
Type
conf
DOI
10.1109/IPDPSW.2014.65
Filename
6969433
Link To Document