Title :
An improved sampling-based DBSCAN for large spatial databases
Author :
Borah, B. ; Bhattacharyya, D.K.
Author_Institution :
Dept. of Inf. Technol., Tezpur Univ., Assam, India
Abstract :
Spatial data clustering is one of the important data mining techniques for extracting knowledge from large amount of spatial data collected in various applications, such as remote sensing, GIS, computer cartography, environmental assessment and planning, etc. Several useful and popular spatial data clustering algorithms have been proposed in the past decade. DBSCAN is one of them, which can discover clusters of any arbitrary shape and can handle the noise points effectively. However, DBSCAN requires large volume of memory support because it operates on the entire database. This paper presents an improved sampling-based DBSCAN which can cluster large-scale spatial databases effectively. Experimental results included to establish that the proposed sampling-based DBSCAN outperforms DBSCAN as well as its other counterparts, in terms of execution time, without losing the quality of clustering.
Keywords :
data mining; pattern clustering; sampling methods; very large databases; visual databases; data mining techniques; density based spatial clustering of application with noise; image sampling; knowledge extraction; large spatial databases; spatial data clustering algorithms; Application software; Clustering algorithms; Data mining; Geographic Information Systems; Large-scale systems; Noise shaping; Remote sensing; Shape; Spatial databases; Working environment noise;
Conference_Titel :
Intelligent Sensing and Information Processing, 2004. Proceedings of International Conference on
Print_ISBN :
0-7803-8243-9
DOI :
10.1109/ICISIP.2004.1287631