Title :
Mining frequent itemsets in distributed and dynamic databases
Author :
Otey, M.E. ; Wang, C. ; Parthasarathy, S. ; Veloso, A. ; Meira, W., Jr.
Author_Institution :
Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
Abstract :
Traditional methods for frequent itemset mining typically assume that data is centralized and static. Such methods impose excessive communication overhead when data is distributed, and they waste computational resources when data is dynamic. We present what we believe to be the first unified approach that overcomes these assumptions. Our approach makes use of parallel and incremental techniques to generate frequent itemsets in the presence of data updates without examining the entire database, and imposes minimal communication overhead when mining distributed databases. Further, our approach is able to generate both local and global frequent itemsets. This ability permits our approach to identify high-contrast frequent itemsets, which allows one to examine how the data is skewed over different sites.
Keywords :
data mining; distributed databases; minimisation; parallel algorithms; query processing; communication overhead minimization; distributed databases; dynamic databases; frequent itemsets mining; incremental techniques; parallel techniques; query response time; Computer networks; Computer science; Data mining; Distributed computing; Distributed databases; Frequency; Information science; Itemsets; Parallel algorithms; Transaction databases;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250991