Title :
DBSCALE: An efficient density-based clustering algorithm for data mining in large databases
Author :
Tsai, Cheng-Fa ; Sung, Chun-Yi
Author_Institution :
Dept. of Manage. Inf. Syst., Nat. Pingtung Univ. of Sci. & Technol., Pingtung, Taiwan
Abstract :
This work presents a novel clustering algorithm that incorporates neighbor searching and expansion seed selection into a density-based clustering algorithm. Data Points that have been clustered need not be input again when searching for neighborhood data points, and the algorithm redefines eight Marked Boundary Objects to add expansion seeds according to far centrifugal force, which increases coverage. Experimental results indicate that the proposed DBSCALE has a lower execution time cost than DBSCAN, mBSCAN and KIDBSCAN clustering algorithms. DBSCALE has a maximum deviation in clustering correctness rate of 0.29%, and a maximum deviation in noise data clustering rate of 0.14%.
Keywords :
data mining; pattern clustering; DBSCALE; KIDBSCAN clustering algorithms; centrifugal force; clustering correctness rate; data mining; data points; efficient density based clustering algorithm; large databases; marked boundary objects; Clustering algorithms; Databases; Partitioning algorithms; data clustering; data mining; density-based clustering algorithm;
Conference_Titel :
Circuits,Communications and System (PACCS), 2010 Second Pacific-Asia Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-7969-6
DOI :
10.1109/PACCS.2010.5627040