• DocumentCode
    2936649
  • Title

    DBSCALE: An efficient density-based clustering algorithm for data mining in large databases

  • Author

    Tsai, Cheng-Fa ; Sung, Chun-Yi

  • Author_Institution
    Dept. of Manage. Inf. Syst., Nat. Pingtung Univ. of Sci. & Technol., Pingtung, Taiwan
  • Volume
    1
  • fYear
    2010
  • fDate
    1-2 Aug. 2010
  • Firstpage
    98
  • Lastpage
    101
  • Abstract
    This work presents a novel clustering algorithm that incorporates neighbor searching and expansion seed selection into a density-based clustering algorithm. Data Points that have been clustered need not be input again when searching for neighborhood data points, and the algorithm redefines eight Marked Boundary Objects to add expansion seeds according to far centrifugal force, which increases coverage. Experimental results indicate that the proposed DBSCALE has a lower execution time cost than DBSCAN, mBSCAN and KIDBSCAN clustering algorithms. DBSCALE has a maximum deviation in clustering correctness rate of 0.29%, and a maximum deviation in noise data clustering rate of 0.14%.
  • Keywords
    data mining; pattern clustering; DBSCALE; KIDBSCAN clustering algorithms; centrifugal force; clustering correctness rate; data mining; data points; efficient density based clustering algorithm; large databases; marked boundary objects; Clustering algorithms; Databases; Partitioning algorithms; data clustering; data mining; density-based clustering algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits,Communications and System (PACCS), 2010 Second Pacific-Asia Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-7969-6
  • Type

    conf

  • DOI
    10.1109/PACCS.2010.5627040
  • Filename
    5627040