Title :
Alternative Method for Increnentally Constructing the FP-Tree
Author :
Adnan, Muhaimenul ; Alhajj, Reda ; Barker, Ken
Author_Institution :
Dept. of Comput. Sci., Calgary Univ., Alta.
Abstract :
The FP-tree is an effective data structure that facilitates the mining of frequent patterns from transactional databases. But, transactional databases are dynamic in general, and hence modifications on the database must be reflecting onto the FP-tree. Constructing the FP-tree from scratch and incrementally updating the FP-tree are two possible choices. However, from scratch construction turns unfeasible as the database size increases. So, this paper addresses incremental update by extending the FP-tree concepts and manipulation process. Our new approach is capable of handling all kinds of changes; include additions, deletions and modifications. The target is achieved by constructing and incrementally dealing with the complete FP-tree, i.e., with one minimum support. Constructing the complete FP-tree has the other advantage that it provides the freedom of mining for lower minimum support values without the need to reconstruct the tree. However, directly reflecting the changes onto the FP-tree may invalidate the basic FP-tree structure. Thus, we apply a sequence of shuffling and merging operations to validate and maintain the modified tree. The experiments conducted on synthetic and real datasets clearly highlight advantages of the proposed incremental approach over constructing the FP-tree from scratch
Keywords :
data mining; merging; tree data structures; FP-tree structure; data structure; dynamic databases; incremental mining; sequential patterns mining; transactional databases; Association rules; Computer science; Data analysis; Data mining; Data structures; Deductive databases; Intelligent structures; Intelligent systems; Itemsets; Transaction databases; FP-tree; dynamic databases; incremental mining; sequential patterns mining;
Conference_Titel :
Intelligent Systems, 2006 3rd International IEEE Conference on
Conference_Location :
London
Print_ISBN :
1-4244-01996-8
Electronic_ISBN :
1-4244-01996-8
DOI :
10.1109/IS.2006.348469