• DocumentCode
    2194146
  • Title

    A Sketch-Based Architecture for Mining Frequent Items and Itemsets from Distributed Data Streams

  • Author

    Cesario, Eugenio ; Grillo, Antonio ; Mastroianni, Carlo ; Talia, Domenico

  • Author_Institution
    ICAR-CNR, Rende, Italy
  • fYear
    2011
  • fDate
    23-26 May 2011
  • Firstpage
    245
  • Lastpage
    253
  • Abstract
    This paper presents the design and the implementation of an architecture for the analysis of data streams in distributed environments. In particular, data stream analysis has been carried out for the computation of items and item sets that exceed a frequency threshold. The mining approach is hybrid, that is, frequent items are calculated with a single pass, using a sketch algorithm, while frequent item sets are calculated by a further multi-pass analysis. The architecture combines parallel and distributed processing to keep the pace with the rate of distributed data streams. In order to keep computation close to data, miners are distributed among the domains where data streams are generated. The paper also reports the experimental results obtained with a prototype of the architecture, tested on a Grid composed of two domains handling two different data streams.
  • Keywords
    data analysis; data mining; distributed processing; data stream analysis; distributed data streams; distributed processing; frequent item sets; mining approach; sketch-based architecture; Accuracy; Algorithm design and analysis; Computer architecture; Data mining; Distributed databases; Itemsets; Radiation detectors; Grids; distributed data mining; frequent items; frequent itemsets; stream mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cluster, Cloud and Grid Computing (CCGrid), 2011 11th IEEE/ACM International Symposium on
  • Conference_Location
    Newport Beach, CA
  • Print_ISBN
    978-1-4577-0129-0
  • Electronic_ISBN
    978-0-7695-4395-6
  • Type

    conf

  • DOI
    10.1109/CCGrid.2011.45
  • Filename
    5948615