DocumentCode :
3455717
Title :
Progressive Clustering for Database Distribution on a Grid
Author :
Fiolet, Valerie ; Toursel, Bernard
Author_Institution :
Mons-Hainault Univ., Mons
fYear :
2005
fDate :
4-6 July 2005
Firstpage :
282
Lastpage :
289
Abstract :
The increasing availability of clusters and grids of workstations provides cheap and powerful resources for distributed data mining. To exploit these resources we need new algorithms adapted to this kind of environment, in particular with respect to the way to fragment data and to use this fragmentation. An "intelligent" distribution of data is required and can be obtained from clustering. Most existing parallel methods of clustering are developed for supercomputers with shared memory and hence can not be used on a grid. This paper presents a new clustering algorithm, called progressive clustering, which executes a clustering in an efficient and incremental distributed way. The data clusters resulting from this algorithm can subsequently be used in distributed data mining tasks
Keywords :
data mining; database management systems; distributed processing; grid computing; database distribution; distributed data mining; grid computing; intelligent data distribution; progressive clustering; Association rules; Clustering algorithms; Concurrent computing; Data mining; Databases; Distributed computing; Mars; Parallel processing; Supercomputers; Workstations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing, 2005. ISPDC 2005. The 4th International Symposium on
Conference_Location :
Lille
Print_ISBN :
0-7695-2434-6
Type :
conf
DOI :
10.1109/ISPDC.2005.41
Filename :
1609981
Link To Document :
بازگشت