Title :
KNN-kernel based clustering for spatio-temporal database
Author :
Musdholifah, Aina ; Hashim, Siti Zaiton BT Mohd ; Wasito, Ito
Author_Institution :
Fac. of Comput. Sci. & Inf. Syst., Univ. Teknol. Malaysia, Skudai, Malaysia
Abstract :
Extracting and analyzing the interesting patterns from spatio-temporal databases, have drawn a great interest in various fields of research. Recently, a number of experiments have explored the problem of spatial or temporal data mining, and some clustering algorithms have been proposed. However, not many studies have been dealing with the integration of spatial data mining and temporal data mining. Moreover, the data in spatial temporal database can be categorized as high-dimensional data. Current density-based clustering might have difficulties with complex data sets including high-dimensional data. This paper presents Iterative Local Gaussian Clustering (ILGC), an algorithm that combines K-nearest neighbour (KNN) density estimation and Kernel density estimation, to cluster the spatiotemporal data. In this approach, the KNN density estimation is extended and combined with Kernel function, where KNN contributes in determining the best local data iteratively for kernel density estimation. The local best is defined as the set of neighbour data that maximizes the kernel function. Bayesian rule is used to deal with the problem of selecting the best local data. This paper utilized Gaussian kernel which has been proven successful in the clustering. To validate the KNN-kernel based algorithm, we compare its performance againts other popular algorithms, such as Self Organizing Maps (SOM) and K-Means, on Crime database. Results show that KNN-kernel based clustering has outperformed others.
Keywords :
Bayes methods; Gaussian processes; data mining; iterative methods; pattern clustering; self-organising feature maps; temporal databases; visual databases; Bayesian rule; Gaussian kernel; ILGC; KNN density estimation; KNN-kernel based clustering; SOM; clustering algorithms; crime database; current density-based clustering; interesting patterns; iterative local Gaussian clustering; k-means; k-nearest neighbour density estimation; kernel density estimation; self organizing maps; spatial data mining; spatio-temporal database; temporal data mining; Bayesian methods; Clustering algorithms; Data mining; Estimation; Indexes; Kernel; Spatial databases; Bayesian rule; Gaussian Kernel; Iterative Local Gaussian Clustering; KNN; Kernel clustering; Spatio-temporal database;
Conference_Titel :
Computer and Communication Engineering (ICCCE), 2010 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-6233-9
DOI :
10.1109/ICCCE.2010.5556805