• DocumentCode
    2698278
  • Title

    A fast algorithm of mining induced subtrees

  • Author

    Li, Yun ; Guo, Xin ; Yuan, Yunhao ; Wu, Jia ; Chen, Ling

  • Author_Institution
    Inst. of Inf. Eng., Yangzhou Univ., Yangzhou
  • fYear
    2008
  • fDate
    20-23 June 2008
  • Firstpage
    195
  • Lastpage
    199
  • Abstract
    Induced subtrees mining is of important researching value in the fields of XML documents, bioinformatics, web log and so forth. In this paper, the two conceptions of subtree vector and pruning threshold are proposed, and an algorithm ITMSV (induced subtrees mining based on subtree vector) is presented to discover frequent induced subtrees quickly by taking full advantages of the features of subtree vector and combining with the hash table. The algorithm, as a result of constructing a multi-layered data structure, can lessen the time of distinguishing isomorphism during mining, and need scan database only once so that it induces times of scanning and improves the efficiency of algorithm. The experimental result shows that the algorithm ITMSV is more efficient and effective than TreeMiner.
  • Keywords
    XML; cryptography; data mining; tree data structures; ITMSV; TreeMiner; Web log; XML documents; bioinformatics; hash table; induced subtrees mining; multilayered data structure; pruning threshold; subtree vector; Automation; Bioinformatics; Data mining; Data structures; RNA; Sequences; Spatial databases; Tree data structures; Tree graphs; XML; data mining; frequent subtree; induced subtree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Automation, 2008. ICIA 2008. International Conference on
  • Conference_Location
    Changsha
  • Print_ISBN
    978-1-4244-2183-1
  • Electronic_ISBN
    978-1-4244-2184-8
  • Type

    conf

  • DOI
    10.1109/ICINFA.2008.4607995
  • Filename
    4607995