DocumentCode
2698278
Title
A fast algorithm of mining induced subtrees
Author
Li, Yun ; Guo, Xin ; Yuan, Yunhao ; Wu, Jia ; Chen, Ling
Author_Institution
Inst. of Inf. Eng., Yangzhou Univ., Yangzhou
fYear
2008
fDate
20-23 June 2008
Firstpage
195
Lastpage
199
Abstract
Induced subtrees mining is of important researching value in the fields of XML documents, bioinformatics, web log and so forth. In this paper, the two conceptions of subtree vector and pruning threshold are proposed, and an algorithm ITMSV (induced subtrees mining based on subtree vector) is presented to discover frequent induced subtrees quickly by taking full advantages of the features of subtree vector and combining with the hash table. The algorithm, as a result of constructing a multi-layered data structure, can lessen the time of distinguishing isomorphism during mining, and need scan database only once so that it induces times of scanning and improves the efficiency of algorithm. The experimental result shows that the algorithm ITMSV is more efficient and effective than TreeMiner.
Keywords
XML; cryptography; data mining; tree data structures; ITMSV; TreeMiner; Web log; XML documents; bioinformatics; hash table; induced subtrees mining; multilayered data structure; pruning threshold; subtree vector; Automation; Bioinformatics; Data mining; Data structures; RNA; Sequences; Spatial databases; Tree data structures; Tree graphs; XML; data mining; frequent subtree; induced subtree;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Automation, 2008. ICIA 2008. International Conference on
Conference_Location
Changsha
Print_ISBN
978-1-4244-2183-1
Electronic_ISBN
978-1-4244-2184-8
Type
conf
DOI
10.1109/ICINFA.2008.4607995
Filename
4607995
Link To Document