مرکز منطقه ای اطلاع رساني علوم و فناوري - A Fast Density-Based Clustering Algorithm for Large Databases

DocumentCode :

2888493

Title :

A Fast Density-Based Clustering Algorithm for Large Databases

Author :

Liu, Bing

Author_Institution :

China Securities, Beijing

fYear :

2006

fDate :

13-16 Aug. 2006

Firstpage :

996

Lastpage :

1000

Abstract :

DBSCAN is a typical clustering algorithm, which can discover clusters with any arbitrary shape and handle noise well. However, it is also slow in comparison due to neighborhood query for each object and faces difficulty in setting density threshold properly. In this paper, a fast density-based clustering algorithm is presented based on DBSCAN. After sorting objects by a certain dimensional coordinates, the new algorithm selects orderly unlabelled points outside a core object´s neighborhood as seeds to expand clusters so that the execution frequency of region queries can be decreased. Objects are transformed with a kernel function to improve the clustering accuracy, which diminishes the dependency of density threshold to some extent. Theoretic analysis indicates that the time complexity of this algorithm is approximately linear. Experiments show that the efficiency and the quality for clusters of the proposed algorithm are remarkably superior to those of DBSCAN

Keywords :

computational complexity; data mining; pattern clustering; query processing; very large databases; DBSCAN; density-based clustering algorithm; kernel function; large databases; linear approximation; time complexity; Algorithm design and analysis; Clustering algorithms; Cybernetics; Data security; Databases; Frequency; Kernel; Linear approximation; Machine learning; Machine learning algorithms; Noise shaping; Partitioning algorithms; Shape; Sorting; Spatial databases; Clustering; DBSCAN; Kernel transformation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Cybernetics, 2006 International Conference on

Conference_Location :

Dalian, China

Print_ISBN :

1-4244-0061-9

Type :

conf

DOI :

10.1109/ICMLC.2006.258531

Filename :

4028209

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2888493