• DocumentCode
    428816
  • Title

    Advanced pruning strategies to speed up mining closed molecular fragments

  • Author

    Borgelt, Christian ; Meinl, Thorsten ; Berthold, Michael R.

  • Author_Institution
    Sch. of Comput. Sci., Otto von Guericke Univ., Magdeburg, Germany
  • Volume
    5
  • fYear
    2004
  • fDate
    10-13 Oct. 2004
  • Firstpage
    4565
  • Abstract
    In years, several algorithms for mining frequent subgraphs in graph databases have been proposed, with a major application area being the discovery of frequent substructures of biomolecules. Unfortunately, most of these algorithms still struggle with fairly long execution times if larger substructures or molecular fragments are desired. We describe two advanced pruning strategies - equivalent sibling pruning and perfect extension pruning - that can be used to speed up the MoFa algorithm (introduced in C. Borgelt and M.R. Berthold, (2002)) in the search for closed molecular fragments, as we demonstrate with experiments on the NCI´s HIV database.
  • Keywords
    biochemistry; data mining; visual databases; MoFa algorithm; advanced pruning strategies; closed molecular fragment mining; equivalent sibling pruning; frequent subgraphs mining; graph databases; perfect extension pruning; Biochemistry; Computer science; Data mining; Drugs; Ear; Human immunodeficiency virus; Hydrogen; Information science; Protection; Tree graphs;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2004 IEEE International Conference on
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-8566-7
  • Type

    conf

  • DOI
    10.1109/ICSMC.2004.1401251
  • Filename
    1401251