• DocumentCode
    2370732
  • Title

    Mining frequent itemsets in distributed and dynamic databases

  • Author

    Otey, M.E. ; Wang, C. ; Parthasarathy, S. ; Veloso, A. ; Meira, W., Jr.

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
  • fYear
    2003
  • fDate
    19-22 Nov. 2003
  • Firstpage
    617
  • Lastpage
    620
  • Abstract
    Traditional methods for frequent itemset mining typically assume that data is centralized and static. Such methods impose excessive communication overhead when data is distributed, and they waste computational resources when data is dynamic. We present what we believe to be the first unified approach that overcomes these assumptions. Our approach makes use of parallel and incremental techniques to generate frequent itemsets in the presence of data updates without examining the entire database, and imposes minimal communication overhead when mining distributed databases. Further, our approach is able to generate both local and global frequent itemsets. This ability permits our approach to identify high-contrast frequent itemsets, which allows one to examine how the data is skewed over different sites.
  • Keywords
    data mining; distributed databases; minimisation; parallel algorithms; query processing; communication overhead minimization; distributed databases; dynamic databases; frequent itemsets mining; incremental techniques; parallel techniques; query response time; Computer networks; Computer science; Data mining; Distributed computing; Distributed databases; Frequency; Information science; Itemsets; Parallel algorithms; Transaction databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
  • Print_ISBN
    0-7695-1978-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2003.1250991
  • Filename
    1250991