• DocumentCode
    848612
  • Title

    IMine: Index Support for Item Set Mining

  • Author

    Baralis, Elena ; Cerquitelli, Tania ; Chiusano, Silvia

  • Author_Institution
    Dipt. di Autom. e Inf., Politec. di Torino, Torino
  • Volume
    21
  • Issue
    4
  • fYear
    2009
  • fDate
    4/1/2009 12:00:00 AM
  • Firstpage
    493
  • Lastpage
    506
  • Abstract
    This paper presents the IMine index, a general and compact structure which provides tight integration of item set extraction in a relational DBMS. Since no constraint is enforced during the index creation phase, IMine provides a complete representation of the original database. To reduce the I/O cost, data accessed together during the same extraction phase are clustered on the same disk block. The IMine index structure can be efficiently exploited by different item set extraction algorithms. In particular, IMine data access methods currently support the FP-growth and LCM v.2 algorithms, but they can straightforwardly support the enforcement of various constraint categories. The IMine index has been integrated into the PostgreSQL DBMS and exploits its physical level access methods. Experiments, run for both sparse and dense data distributions, show the efficiency of the proposed index and its linear scalability also for large datasets. Item set mining supported by the IMine index shows performance always comparable with, and sometimes better than, state of the art algorithms accessing data on flat file.
  • Keywords
    data mining; database indexing; relational databases; FP-growth algorithm; IMine index; LCM v.2 algorithm; PostgreSQL DBMS; item set extraction; item set mining; relational DBMS; Association rules; Clustering algorithms; Costs; Data mining; Data structures; Indexes; Indexing; Relational databases; Scalability; Transaction databases; Data Mining; Indexing; Itemset Extraction;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.180
  • Filename
    4609383