Title :
Review based on data clustering algorithms
Author :
Nagpal, Arpita ; Jatain, Aman ; Gaur, Deepti
Abstract :
A review based on different types of clustering algorithms with their corresponding data sets has been proposed. In this paper, we have given a complete comparative statistical analysis of various clustering algorithms. Clustering algorithms usually employ distance metric or similarity metric to cluster the data set into different partitions. Well known clustering algorithms have been widely used in various disciplines. Type of clustering algorithm used depends upon the application and data set used in that field. Numerical data set is comparatively easy to implement as data are invariably real number and can be used for statistical applications. Others type of data set such as categorical, time series, boolean, and spatial, temporal have limited applications. By viewing the statistical analysis, it is observed that there is no optimal solution for handling problems with large data sets of mixed and categorical attributes. Some of the algorithms can be applied but their performance degrades as the size of data keeps on increasing.
Keywords :
data handling; pattern clustering; statistical analysis; comparative statistical analysis; data clustering algorithms; data set cluster; optimal solution; similarity metric; statistical application; Algorithm design and analysis; Clustering algorithms; Clustering methods; Communications technology; Conferences; Couplings; Partitioning algorithms; Data Clustering; Special; Statistical analysis; hierarchical clustering; temporal;
Conference_Titel :
Information & Communication Technologies (ICT), 2013 IEEE Conference on
Conference_Location :
JeJu Island
Print_ISBN :
978-1-4673-5759-3
DOI :
10.1109/CICT.2013.6558109