• DocumentCode
    3078634
  • Title

    Accurate Scoring of Drug Conformations at the Extreme Scale

  • Author

    Boyu Zhang ; Estrada, Trilce ; Cicotti, Pietro ; Balaji, Pavan ; Taufer, Michela

  • Author_Institution
    Univ. of Delaware, Newark, DE, USA
  • fYear
    2015
  • fDate
    4-7 May 2015
  • Firstpage
    817
  • Lastpage
    822
  • Abstract
    We present a scalable method to extensively search for and accurately select pharmaceutical drug candidates in large spaces of drug conformations computationally generated and stored across the nodes of a large distributed system. For each legend conformation in the dataset, our method first extracts relevant geometrical properties and transforms the properties into a single metadata point in the three-dimensional space. Then, it performs an ochre-based clustering on the metadata to search for predominant clusters. Our method avoids the need to move legend conformations among nodes because it extracts relevant data properties locally and concurrently. By doing so, we can perform accurate and scalable distributed clustering analysis on large distributed datasets. We scale the analysis of our pharmaceutical datasets a factor of 400X higher in performance and 500X larger in size than ever before. We also show that our clustering achieves higher accuracy compared with that of traditional clustering methods and conformational scoring based on minimum energy.
  • Keywords
    bioinformatics; distributed databases; drugs; meta data; octrees; pattern clustering; computationally generated drug conformations; data extraction; data storage; distributed clustering analysis; drug conformation scoring; geometrical properties; large distributed system; large-distributed datasets; ligand conformation; metadata; octree-based clustering; pharmaceutical dataset analysis; pharmaceutical drug selection; predominant cluster search; scalable method; three-dimensional space; Accuracy; Aggregates; Geometry; Human immunodeficiency virus; Proteins; Scalability; Three-dimensional displays; Accuracy; Data reduction; Ligand conformations; Octree-based clustering; Protein-ligand docking; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on
  • Conference_Location
    Shenzhen
  • Type

    conf

  • DOI
    10.1109/CCGrid.2015.94
  • Filename
    7152564