DocumentCode :
1422336
Title :
OInduced: An Efficient Algorithm for Mining Induced Patterns From Rooted Ordered Trees
Author :
Chehreghani, Mostafa Haghir ; Chehreghani, Morteza Haghir ; Lucas, Caro ; Rahgozar, Masoud
Author_Institution :
Sch. of Electr. & Comput. Eng., Univ. of Tehran, Tehran, Iran
Volume :
41
Issue :
5
fYear :
2011
Firstpage :
1013
Lastpage :
1025
Abstract :
Frequent tree patterns have many practical applications in different domains, such as Extensible Markup Language mining, Web usage analysis, etc. In this paper, we present OInduced , which is a novel and efficient algorithm for finding frequent ordered induced tree patterns. OInduced uses a breadth-first candidate generation method and improves it by means of an indexing scheme. We also introduce frequency counting using tree encoding. For this purpose, we present two novel tree encodings, namely, m-coding and cm-coding, and show how they can restrict nodes of input trees and compute frequencies of generated candidates. We perform extensive experiments on both real and synthetic data sets to show the efficiency and scalability of OInduced.
Keywords :
data mining; pattern clustering; tree codes; trees (mathematics); OInduced; cm-coding; indexing; induced pattern mining; rooted ordered trees; tree encoding; Algorithm design and analysis; Data mining; Data structures; Encoding; Indexing; Breadth-first candidate generation; frequency counting; frequent tree pattern; induced subtree; rooted ordered labeled tree; tree encoding;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on
Publisher :
ieee
ISSN :
1083-4427
Type :
jour
DOI :
10.1109/TSMCA.2010.2096808
Filename :
5682417
Link To Document :
بازگشت