DocumentCode
428816
Title
Advanced pruning strategies to speed up mining closed molecular fragments
Author
Borgelt, Christian ; Meinl, Thorsten ; Berthold, Michael R.
Author_Institution
Sch. of Comput. Sci., Otto von Guericke Univ., Magdeburg, Germany
Volume
5
fYear
2004
fDate
10-13 Oct. 2004
Firstpage
4565
Abstract
In years, several algorithms for mining frequent subgraphs in graph databases have been proposed, with a major application area being the discovery of frequent substructures of biomolecules. Unfortunately, most of these algorithms still struggle with fairly long execution times if larger substructures or molecular fragments are desired. We describe two advanced pruning strategies - equivalent sibling pruning and perfect extension pruning - that can be used to speed up the MoFa algorithm (introduced in C. Borgelt and M.R. Berthold, (2002)) in the search for closed molecular fragments, as we demonstrate with experiments on the NCI´s HIV database.
Keywords
biochemistry; data mining; visual databases; MoFa algorithm; advanced pruning strategies; closed molecular fragment mining; equivalent sibling pruning; frequent subgraphs mining; graph databases; perfect extension pruning; Biochemistry; Computer science; Data mining; Drugs; Ear; Human immunodeficiency virus; Hydrogen; Information science; Protection; Tree graphs;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2004 IEEE International Conference on
ISSN
1062-922X
Print_ISBN
0-7803-8566-7
Type
conf
DOI
10.1109/ICSMC.2004.1401251
Filename
1401251
Link To Document