• DocumentCode
    632435
  • Title

    Robust Local Triangular Kernel density-based clustering for high-dimensional data

  • Author

    Musdholifah, Aina ; Hashim, Siti Z. M. ; Ngah, Razali

  • Author_Institution
    Dept. of Comput. Sci. & Electron., Univ. Gadjah Mada (UGM), Yogyakarta, Indonesia
  • fYear
    2013
  • fDate
    27-28 March 2013
  • Firstpage
    24
  • Lastpage
    32
  • Abstract
    A number of clustering algorithms can be employed to find clusters in multivariate data. However, the effectiveness and efficiency of the existing algorithms are limited, since the respective data has high dimension, contain large amount of noise and consist of clusters with arbitrary shapes and densities. In this paper, a new kernel density-based clustering algorithm, called Local Triangular Kernel-based Clustering (LTKC), is proposed to deal with these conditions. LTKC is based on combination of k-nearest-neighbor density estimation and triangular kernel density-based clustering. The advantages of our LTKC approach are: (1) it has a firm mathematical basis; (2) it requires only one parameter, number of neighbors; (3) it defines the number of cluster automatically; (4) it allows discovering clusters with arbitrary shapes and densities ;and (5) it is significantly faster than existing algorithms. LTKC is tested using artificial data and applied to some UCI data. A comparison with k-means, KFCM and well known density-based clustering algorithms including ILGC, DBSCAN, and DENCLUE shows the superiority of our proposed LTKC algorithm.
  • Keywords
    pattern clustering; DBSCAN; DENCLUE; ILGC; KFCM; LTKC; UCI data; artificial data; cluster discovery; high-dimensional data; k-means clustering algorithms; k-nearest-neighbor density estimation; multivariate data; robust local triangular kernel density-based clustering; Accuracy; Classification algorithms; Clustering algorithms; Density functional theory; Estimation; Kernel; Shape; Classification; Clustering; density-based clustering; high-dimensional data; k-nearest-neighbor density estimation; kernel density estimation; multivariate data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology (CSIT), 2013 5th International Conference on
  • Conference_Location
    Amman
  • Type

    conf

  • DOI
    10.1109/CSIT.2013.6588753
  • Filename
    6588753