DocumentCode :
2698278
Title :
A fast algorithm of mining induced subtrees
Author :
Li, Yun ; Guo, Xin ; Yuan, Yunhao ; Wu, Jia ; Chen, Ling
Author_Institution :
Inst. of Inf. Eng., Yangzhou Univ., Yangzhou
fYear :
2008
fDate :
20-23 June 2008
Firstpage :
195
Lastpage :
199
Abstract :
Induced subtrees mining is of important researching value in the fields of XML documents, bioinformatics, web log and so forth. In this paper, the two conceptions of subtree vector and pruning threshold are proposed, and an algorithm ITMSV (induced subtrees mining based on subtree vector) is presented to discover frequent induced subtrees quickly by taking full advantages of the features of subtree vector and combining with the hash table. The algorithm, as a result of constructing a multi-layered data structure, can lessen the time of distinguishing isomorphism during mining, and need scan database only once so that it induces times of scanning and improves the efficiency of algorithm. The experimental result shows that the algorithm ITMSV is more efficient and effective than TreeMiner.
Keywords :
XML; cryptography; data mining; tree data structures; ITMSV; TreeMiner; Web log; XML documents; bioinformatics; hash table; induced subtrees mining; multilayered data structure; pruning threshold; subtree vector; Automation; Bioinformatics; Data mining; Data structures; RNA; Sequences; Spatial databases; Tree data structures; Tree graphs; XML; data mining; frequent subtree; induced subtree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Automation, 2008. ICIA 2008. International Conference on
Conference_Location :
Changsha
Print_ISBN :
978-1-4244-2183-1
Electronic_ISBN :
978-1-4244-2184-8
Type :
conf
DOI :
10.1109/ICINFA.2008.4607995
Filename :
4607995
Link To Document :
بازگشت