DocumentCode :
2506878
Title :
Using grid for accelerating density-based clustering
Author :
Mahran, Shaaban ; Mahar, K.
Author_Institution :
Arab Acad. for Sci. & Technol., Alexandria
fYear :
2008
fDate :
8-11 July 2008
Firstpage :
35
Lastpage :
40
Abstract :
Clustering analysis is a primary method for data mining. The ever increasing volumes of data in different applications forces clustering algorithms to cope with it. DBSCAN is a well-known algorithm for density-based clustering. It is both effective so it can detect arbitrary shaped clusters of dense regions and efficient especially in existence of spatial indexes to perform the neighborhood queries efficiently. In this paper we introduce a new algorithm GriDBSCAN to enhance the performance of DBSCAN using grid partitioning and merging, yielding a high performance with the advantage of high degree of parallelism. We verified the correctness of the algorithm theoretically and experimentally, studied the performance theoretically and using experiments on both real and synthetic data. It proved to run much faster than original DBSCAN. We compared the algorithm with a similar algorithm, EnhancedDBSCAN, which is also an enhancement to DBSCAN using partitioning. Experiments showed the new algorithm´s superiority in performance and degree of parallelism.
Keywords :
data analysis; data mining; grid computing; pattern clustering; GriDBSCAN; data mining; density-based clustering; grid merging; grid partitioning; Acceleration; Clustering algorithms; Data mining; Databases; Gravity; Merging; Parallel processing; Partitioning algorithms; Shape; Spatial indexes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology, 2008. CIT 2008. 8th IEEE International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-2357-6
Electronic_ISBN :
978-1-4244-2358-3
Type :
conf
DOI :
10.1109/CIT.2008.4594646
Filename :
4594646
Link To Document :
بازگشت