DocumentCode :
2399946
Title :
Itemset Mining on Indexed Data Blocks
Author :
Baralis, Elena ; Cerquitelli, Tania ; Chiusano, Silvia
Author_Institution :
Dipt. di Automatica e Informatica, Politecnico di Torino
fYear :
2006
fDate :
Sept. 2006
Firstpage :
820
Lastpage :
825
Abstract :
This paper presents a novel index, called I-Forest, to support data mining activities on evolving databases, whose content is periodically updated through insertion (or deletion) of data blocks. I-Forest allows the extraction of itemsets from transactional databases such as transactional data from large retail chains. Item, support and time constraints may be enforced during the extraction phase. The proposed index is a covering index that represents transactional blocks in a succinct form and allows different kinds of analysis (e.g., analyze quarterly data). During the creation phase no support constraint is enforced. Thus, the index provides a complete representation of the evolving data. The I-Forest index has been implemented Into the Post-greSQL open source DBMS and exploits its physical level access methods. Experiments have been run for both sparse and dense data distributions. The execution time of the frequent itemset extraction task exploiting the index is always comparable with and for low support threshold faster than the Prefix-Tree algorithm accessing static data on at file
Keywords :
data mining; database indexing; DBMS open source; I-Forest; Post-greSQL open source; Prefix-Tree algorithm; data block indexing; data distributions; data mining; itemset mining; transactional databases; Data mining; Frequency; Indexes; Information retrieval; Intelligent systems; Itemsets; Performance analysis; Time factors; Transaction databases; Web server; Algorithms; Itemset Extraction; Performance; Relational DBMS;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems, 2006 3rd International IEEE Conference on
Conference_Location :
London
Print_ISBN :
1-4244-01996-8
Electronic_ISBN :
1-4244-01996-8
Type :
conf
DOI :
10.1109/IS.2006.348526
Filename :
4155533
Link To Document :
بازگشت