DocumentCode :
2370732
Title :
Mining frequent itemsets in distributed and dynamic databases
Author :
Otey, M.E. ; Wang, C. ; Parthasarathy, S. ; Veloso, A. ; Meira, W., Jr.
Author_Institution :
Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
fYear :
2003
fDate :
19-22 Nov. 2003
Firstpage :
617
Lastpage :
620
Abstract :
Traditional methods for frequent itemset mining typically assume that data is centralized and static. Such methods impose excessive communication overhead when data is distributed, and they waste computational resources when data is dynamic. We present what we believe to be the first unified approach that overcomes these assumptions. Our approach makes use of parallel and incremental techniques to generate frequent itemsets in the presence of data updates without examining the entire database, and imposes minimal communication overhead when mining distributed databases. Further, our approach is able to generate both local and global frequent itemsets. This ability permits our approach to identify high-contrast frequent itemsets, which allows one to examine how the data is skewed over different sites.
Keywords :
data mining; distributed databases; minimisation; parallel algorithms; query processing; communication overhead minimization; distributed databases; dynamic databases; frequent itemsets mining; incremental techniques; parallel techniques; query response time; Computer networks; Computer science; Data mining; Distributed computing; Distributed databases; Frequency; Information science; Itemsets; Parallel algorithms; Transaction databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
Type :
conf
DOI :
10.1109/ICDM.2003.1250991
Filename :
1250991
Link To Document :
بازگشت