• DocumentCode
    392441
  • Title

    MotifMiner: a general toolkit for efficiently identifying common substructures in molecules

  • Author

    Coatney, Matt ; Parthasarathy, Srinivasan

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
  • fYear
    2003
  • fDate
    10-12 March 2003
  • Firstpage
    336
  • Lastpage
    340
  • Abstract
    Scientific research often involves examining structural relationships in molecules since scientists strongly believe in the causal relationship between structure and function. Traditionally, researchers have identified these patterns, or motifs, manually using biochemical expertise. However, with the massive influx of new biochemical data and the ability to gather data for very large molecules, there is great need for techniques that automatically and efficiently identify commonly occurring structural patterns in molecules. Previous automated substructure discovery approaches have each introduced variations of similar underlying techniques and have embedded domain knowledge. While doing so improves performance for the particular domain, this complicates extensibility to other domains. Also, they do not address scalability or noise, which is critical for certain structural domains like macromolecules. In this paper, we present MotifMiner, a general toolkit for automatically identifying common motifs in most any scientific molecular dataset. We describe both our application framework and services for identifying motifs, as well as demonstrate the flexibility of our system by analyzing several disparate domains, including protein, drug, and MD simulation datasets.
  • Keywords
    biochemistry; biology computing; molecular biophysics; molecular configurations; proteins; software tools; MD simulation datasets; MotifMiner; automated substructure discovery approaches; biochemical data; biochemical expertise; common molecular substructures identification; drug; embedded domain knowledge; identifying motifs; scalability; Analytical models; Bioinformatics; Chemical analysis; Drugs; Explosions; Information science; Isolation technology; Manuals; Proteins; Scalability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Bioengineering, 2003. Proceedings. Third IEEE Symposium on
  • Print_ISBN
    0-7695-1907-5
  • Type

    conf

  • DOI
    10.1109/BIBE.2003.1188971
  • Filename
    1188971