DocumentCode
2019259
Title
All Common Embedded Subtrees for Measuring Tree Similarity
Author
Lin, Zhiwei ; Wang, Hui ; Mcclean, Sally ; Liu, Chang
Author_Institution
Fac. of Comput. & Eng., Univ. of Ulster, Belfast
Volume
1
fYear
2008
fDate
17-18 Oct. 2008
Firstpage
29
Lastpage
32
Abstract
Tree similarity measurement is key to tree-like data mining. In order to maximally capture common information between trees, we consider the problem of computing all common embedded subtrees, and advocate using the number/count of all common embedded subtrees as a measure of similarity. This problem is not trivial due to the inherent complexity of trees and the ensued large search space. The problem is theoretically analyzed and an effective algorithm for counting all common embedded subtrees is presented. Experimental evaluation shows that the all common embedded subtree similarity is very competitive against tree edit distance, in terms of both efficiency and effectiveness.
Keywords
computational complexity; data mining; search problems; tree data structures; trees (mathematics); all common embedded subtree; computational complexity; data mining; search space; tree similarity measurement; Algorithm design and analysis; Computational intelligence; Data engineering; Data mining; Design engineering; Embedded computing; Polynomials; Sequences; Text mining; Tree data structures; all common embedded subtree; tree edit distance; tree similarity measurement;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Design, 2008. ISCID '08. International Symposium on
Conference_Location
Wuhan
Print_ISBN
978-0-7695-3311-7
Type
conf
DOI
10.1109/ISCID.2008.20
Filename
4725550
Link To Document