• DocumentCode
    2019259
  • Title

    All Common Embedded Subtrees for Measuring Tree Similarity

  • Author

    Lin, Zhiwei ; Wang, Hui ; Mcclean, Sally ; Liu, Chang

  • Author_Institution
    Fac. of Comput. & Eng., Univ. of Ulster, Belfast
  • Volume
    1
  • fYear
    2008
  • fDate
    17-18 Oct. 2008
  • Firstpage
    29
  • Lastpage
    32
  • Abstract
    Tree similarity measurement is key to tree-like data mining. In order to maximally capture common information between trees, we consider the problem of computing all common embedded subtrees, and advocate using the number/count of all common embedded subtrees as a measure of similarity. This problem is not trivial due to the inherent complexity of trees and the ensued large search space. The problem is theoretically analyzed and an effective algorithm for counting all common embedded subtrees is presented. Experimental evaluation shows that the all common embedded subtree similarity is very competitive against tree edit distance, in terms of both efficiency and effectiveness.
  • Keywords
    computational complexity; data mining; search problems; tree data structures; trees (mathematics); all common embedded subtree; computational complexity; data mining; search space; tree similarity measurement; Algorithm design and analysis; Computational intelligence; Data engineering; Data mining; Design engineering; Embedded computing; Polynomials; Sequences; Text mining; Tree data structures; all common embedded subtree; tree edit distance; tree similarity measurement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Design, 2008. ISCID '08. International Symposium on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-0-7695-3311-7
  • Type

    conf

  • DOI
    10.1109/ISCID.2008.20
  • Filename
    4725550