• DocumentCode
    3249468
  • Title

    Adaptive and resource-aware mining of frequent sets

  • Author

    Orlando, S. ; Palmerini, P. ; Perego, R. ; Silvestri, F.

  • Author_Institution
    Dipt. di Informatica, Universita Ca´´ Foscari, Venezia, Italy
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    338
  • Lastpage
    345
  • Abstract
    The performance of an algorithm that mines frequent sets from transactional databases may severely depend on the specific features of the data being analyzed. Moreover, some architectural characteristics of the computational platform used - e.g. the available main memory - can dramatically change its runtime behavior. In this paper we present DCI (Direct Count & Intersect), an efficient algorithm for discovering frequent sets from large databases. Due to the multiple heuristics strategies adopted, DCI can adapt its behavior not only to the features of the specific computing platform, but also to the features of the dataset being mined, so that it results very effective in mining both short and long patterns from sparse and dense datasets. Finally we also discuss the parallelization strategies adopted in the design of ParDCI, a distributed and multi-threaded implementation of DCI.
  • Keywords
    data mining; database management systems; DCl; ParDCl; adaptive mining; frequent sets; large databases; multiple heuristics strategies; resource-aware mining; transactional databases; Algorithm design and analysis; Association rules; Data analysis; Data mining; Itemsets; Performance analysis; Power capacitors; Runtime; Spatial databases; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-1754-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2002.1183921
  • Filename
    1183921