• DocumentCode
    2637287
  • Title

    A distribution-based clustering algorithm for mining in large spatial databases

  • Author

    Xu, Xiaowei ; Ester, Martin ; Kriegel, Hans-Peter ; Sander, Jörg

  • Author_Institution
    Munich Univ., Germany
  • fYear
    1998
  • fDate
    23-27 Feb 1998
  • Firstpage
    324
  • Lastpage
    331
  • Abstract
    The problem of detecting clusters of points belonging to a spatial point process arises in many applications. In this paper, we introduce the new clustering algorithm DBCLASD (Distribution-Based Clustering of LArge Spatial Databases) to discover clusters of this type. The results of experiments demonstrate that DBCLASD, contrary to partitioning algorithms such as CLARANS (Clustering Large Applications based on RANdomized Search), discovers clusters of arbitrary shape. Furthermore, DBCLASD does not require any input parameters, in contrast to the clustering algorithm DBSCAN (Density-Based Spatial Clustering of Applications with Noise) requiring two input parameters, which may be difficult to provide for large databases. In terms of efficiency, DBCLASD is between CLARANS and DBSCAN, close to DBSCAN. Thus, the efficiency of DBCLASD on large spatial databases is very attractive when considering its nonparametric nature and its good quality for clusters of arbitrary shape
  • Keywords
    data analysis; deductive databases; knowledge acquisition; very large databases; visual databases; CLARANS; DBCLASD; DBSCAN; arbitrary shaped clusters; data mining; density-based spatial clustering; distribution-based clustering algorithm; efficiency; input parameters; large spatial databases; noise; nonparametric nature; partitioning algorithms; point cluster detection; quality; randomized search; spatial point process; Clustering algorithms; Clustering methods; Electrical capacitance tomography; Gravity; Iterative algorithms; Nearest neighbor searches; Noise shaping; Partitioning algorithms; Shape; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 1998. Proceedings., 14th International Conference on
  • Conference_Location
    Orlando, FL
  • ISSN
    1063-6382
  • Print_ISBN
    0-8186-8289-2
  • Type

    conf

  • DOI
    10.1109/ICDE.1998.655795
  • Filename
    655795