Title :
Mining Maximal Embedded Unordered Tree Patterns
Author :
Chehreghani, Mostafa Haghir ; Rahgozar, Masoud ; Lucas, Craig
Author_Institution :
Fac. of Electron. in Commun. Eng., Tehran Univ.
fDate :
March 1 2007-April 5 2007
Abstract :
Mining frequent tree patterns has many practical applications in areas such as XML document mining, Web mining, bioinformatics, network routing and so on. Most of the previous works used an apriori-based approach for candidate generation and frequency counting in their algorithms. In these approaches the state space grows exponentially since many unreal candidates are generated, especially when there are lots of large patterns among the data. To tackle these problems, we propose TDU, a top-down approach for mining all maximal, labeled, unordered, and embedded subtrees from a collection of tree-structured data. We would evaluate the effectiveness of the TDU algorithm in comparison to the previous works
Keywords :
data mining; tree data structures; apriori-based approach; candidate generation; frequency counting; maximal embedded unordered tree pattern mining; top-down approach; tree-structured data; Association rules; Bioinformatics; Computational intelligence; Data mining; Encoding; Frequency; Lattices; Routing; Web mining; XML;
Conference_Titel :
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0705-2
DOI :
10.1109/CIDM.2007.368907