DocumentCode :
492504
Title :
Extraction of Frequent Tree Patterns without Subtrees Maintenance
Author :
Paik, Juryon ; Choi, WonGil ; Lee, Eunjoo ; Kim, Ung Mo
Author_Institution :
Dept. of Comput. Eng., Sungkyunkwan Univ., Suwon
Volume :
2
fYear :
2008
fDate :
13-15 Dec. 2008
Firstpage :
54
Lastpage :
59
Abstract :
The inherent flexibility in both structure and semantics let tree capture most kinds of data, model a wide variety of data sources, and produce an enormous number of information. The ability to extract valuable knowledge from them becomes increasingly important and desirable, however, existing tree mining algorithms suffer from several serious pitfalls in finding frequent patterns from massive tree datasets, because most of them have used a priori property for candidate generation and frequency counting. Some of the major problems are due to (1) modeling data as hierarchical tree structure, (2) computationally high cost of the candidate maintenance, (3) repetitious input dataset scans, and (4) the high memory dependency. Therefore, a more efficient and practical approach for tree data is required. In this paper, we systematically develop the pattern growth method instead of the a priori method, for mining maximal frequent tree patterns which are special frequent patterns of a set of trees. The proposed method not only gets rid of the process for infrequent subtrees pruning, but also totally eliminates the problem of generating candidate subtrees. Hence, it significantly improves the whole mining process.
Keywords :
data mining; data models; tree data structures; candidate generation; candidate maintenance; data modeling; frequency counting; hierarchical tree structure; knowledge extraction; massive tree dataset; maximal frequent tree pattern extraction; memory dependency; pattern growth method; subtree maintenance; tree mining; Computational efficiency; Computer networks; Conferences; Data engineering; Data mining; Databases; Frequency; Pervasive computing; Testing; Tree data structures; Frequent subtrees; Pattern-growth; Tree Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Future Generation Communication and Networking Symposia, 2008. FGCNS '08. Second International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-3430-5
Electronic_ISBN :
978-0-7695-3546-3
Type :
conf
DOI :
10.1109/FGCNS.2008.70
Filename :
4813521
Link To Document :
بازگشت