DocumentCode
2370732
Title
Mining frequent itemsets in distributed and dynamic databases
Author
Otey, M.E. ; Wang, C. ; Parthasarathy, S. ; Veloso, A. ; Meira, W., Jr.
Author_Institution
Dept. of Comput. & Inf. Sci., Ohio State Univ., Columbus, OH, USA
fYear
2003
fDate
19-22 Nov. 2003
Firstpage
617
Lastpage
620
Abstract
Traditional methods for frequent itemset mining typically assume that data is centralized and static. Such methods impose excessive communication overhead when data is distributed, and they waste computational resources when data is dynamic. We present what we believe to be the first unified approach that overcomes these assumptions. Our approach makes use of parallel and incremental techniques to generate frequent itemsets in the presence of data updates without examining the entire database, and imposes minimal communication overhead when mining distributed databases. Further, our approach is able to generate both local and global frequent itemsets. This ability permits our approach to identify high-contrast frequent itemsets, which allows one to examine how the data is skewed over different sites.
Keywords
data mining; distributed databases; minimisation; parallel algorithms; query processing; communication overhead minimization; distributed databases; dynamic databases; frequent itemsets mining; incremental techniques; parallel techniques; query response time; Computer networks; Computer science; Data mining; Distributed computing; Distributed databases; Frequency; Information science; Itemsets; Parallel algorithms; Transaction databases;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN
0-7695-1978-4
Type
conf
DOI
10.1109/ICDM.2003.1250991
Filename
1250991
Link To Document