DocumentCode
618298
Title
Review based on data clustering algorithms
Author
Nagpal, Arpita ; Jatain, Aman ; Gaur, Deepti
fYear
2013
fDate
11-12 April 2013
Firstpage
298
Lastpage
303
Abstract
A review based on different types of clustering algorithms with their corresponding data sets has been proposed. In this paper, we have given a complete comparative statistical analysis of various clustering algorithms. Clustering algorithms usually employ distance metric or similarity metric to cluster the data set into different partitions. Well known clustering algorithms have been widely used in various disciplines. Type of clustering algorithm used depends upon the application and data set used in that field. Numerical data set is comparatively easy to implement as data are invariably real number and can be used for statistical applications. Others type of data set such as categorical, time series, boolean, and spatial, temporal have limited applications. By viewing the statistical analysis, it is observed that there is no optimal solution for handling problems with large data sets of mixed and categorical attributes. Some of the algorithms can be applied but their performance degrades as the size of data keeps on increasing.
Keywords
data handling; pattern clustering; statistical analysis; comparative statistical analysis; data clustering algorithms; data set cluster; optimal solution; similarity metric; statistical application; Algorithm design and analysis; Clustering algorithms; Clustering methods; Communications technology; Conferences; Couplings; Partitioning algorithms; Data Clustering; Special; Statistical analysis; hierarchical clustering; temporal;
fLanguage
English
Publisher
ieee
Conference_Titel
Information & Communication Technologies (ICT), 2013 IEEE Conference on
Conference_Location
JeJu Island
Print_ISBN
978-1-4673-5759-3
Type
conf
DOI
10.1109/CICT.2013.6558109
Filename
6558109
Link To Document