DocumentCode :
3139689
Title :
An Efficient Algorithm to Mine Unordered Trees
Author :
Li, Yun ; Guo, Xin ; Yuan, Yunhao ; Wu, Jia ; Chen, Ling
Author_Institution :
Inst. of Inf. Eng., Yangzhou Univ., Yangzhou, China
fYear :
2009
fDate :
1-3 June 2009
Firstpage :
331
Lastpage :
336
Abstract :
Mining unordered trees are very useful in domains like XML date, biological information, Web structure, etc. In this paper, we introduce an efficient algorithm UTMiner (unordered trees miner). As the trees are unordered, in order to avoid mining the same subtrees, an efficient unordered trees standardization is first introduced to transform the unordered trees into the standard subtrees. Then UTMiner is used to get all standardized subtrees. UTMiner builds a multilayered data structure based on subtree vector and the hash table so it reduces isomorphism time in the mining process. It requires only one database scanning so it reduces the scanning times and improves the efficiency, particularly in a large databasepsilas mining process. Many experiments have shown that the UTMiner is feasible and more efficient than other.
Keywords :
data mining; tree data structures; very large databases; UTMiner; database scanning; hash table; isomorphism time; large database; multilayered data structure; standardized subtrees; subtree vector; unordered trees miner; unordered trees standardization; Biology; Data mining; Data structures; Databases; Information science; RNA; Standardization; Tree data structures; Tree graphs; XML; data mining; frequent subtree; unordered tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Science, 2009. ICIS 2009. Eighth IEEE/ACIS International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3641-5
Type :
conf
DOI :
10.1109/ICIS.2009.133
Filename :
5222888
Link To Document :
بازگشت