• DocumentCode
    2399946
  • Title

    Itemset Mining on Indexed Data Blocks

  • Author

    Baralis, Elena ; Cerquitelli, Tania ; Chiusano, Silvia

  • Author_Institution
    Dipt. di Automatica e Informatica, Politecnico di Torino
  • fYear
    2006
  • fDate
    Sept. 2006
  • Firstpage
    820
  • Lastpage
    825
  • Abstract
    This paper presents a novel index, called I-Forest, to support data mining activities on evolving databases, whose content is periodically updated through insertion (or deletion) of data blocks. I-Forest allows the extraction of itemsets from transactional databases such as transactional data from large retail chains. Item, support and time constraints may be enforced during the extraction phase. The proposed index is a covering index that represents transactional blocks in a succinct form and allows different kinds of analysis (e.g., analyze quarterly data). During the creation phase no support constraint is enforced. Thus, the index provides a complete representation of the evolving data. The I-Forest index has been implemented Into the Post-greSQL open source DBMS and exploits its physical level access methods. Experiments have been run for both sparse and dense data distributions. The execution time of the frequent itemset extraction task exploiting the index is always comparable with and for low support threshold faster than the Prefix-Tree algorithm accessing static data on at file
  • Keywords
    data mining; database indexing; DBMS open source; I-Forest; Post-greSQL open source; Prefix-Tree algorithm; data block indexing; data distributions; data mining; itemset mining; transactional databases; Data mining; Frequency; Indexes; Information retrieval; Intelligent systems; Itemsets; Performance analysis; Time factors; Transaction databases; Web server; Algorithms; Itemset Extraction; Performance; Relational DBMS;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems, 2006 3rd International IEEE Conference on
  • Conference_Location
    London
  • Print_ISBN
    1-4244-01996-8
  • Electronic_ISBN
    1-4244-01996-8
  • Type

    conf

  • DOI
    10.1109/IS.2006.348526
  • Filename
    4155533