DBSCALE: An efficient density-based clustering algorithm for data mining in large databases

Author

Tsai, Cheng-Fa ; Sung, Chun-Yi

Author_Institution

Dept. of Manage. Inf. Syst., Nat. Pingtung Univ. of Sci. & Technol., Pingtung, Taiwan

Volume

1

fYear

2010

fDate

1-2 Aug. 2010

Firstpage

98

Lastpage

101

Abstract

This work presents a novel clustering algorithm that incorporates neighbor searching and expansion seed selection into a density-based clustering algorithm. Data Points that have been clustered need not be input again when searching for neighborhood data points, and the algorithm redefines eight Marked Boundary Objects to add expansion seeds according to far centrifugal force, which increases coverage. Experimental results indicate that the proposed DBSCALE has a lower execution time cost than DBSCAN, mBSCAN and KIDBSCAN clustering algorithms. DBSCALE has a maximum deviation in clustering correctness rate of 0.29%, and a maximum deviation in noise data clustering rate of 0.14%.

Keywords

data mining; pattern clustering; DBSCALE; KIDBSCAN clustering algorithms; centrifugal force; clustering correctness rate; data mining; data points; efficient density based clustering algorithm; large databases; marked boundary objects; Clustering algorithms; Databases; Partitioning algorithms; data clustering; data mining; density-based clustering algorithm;

fLanguage

English

Publisher

ieee

Conference_Titel

Circuits,Communications and System (PACCS), 2010 Second Pacific-Asia Conference on

Conference_Location

Beijing

Print_ISBN

978-1-4244-7969-6

Type

conf

DOI

10.1109/PACCS.2010.5627040

Filename

5627040