Author/Authors :
ALZAND, Hüssein Ridha Ali Gazi Üniversitesi - Bilişim Enstitüsü - Bilgisayar Bilimleri Anabilim Dalı, Turkey , KARACAN, Hacer Gazi Üniversitesi - Mühendislik Fakültesi - Bilgisayar Mühendisliği Bölümü, Turkey
Title Of Article :
Comparison of partitioning-based clustering algorithms on differently distributed data
شماره ركورد :
28223
Abstract :
As a result of widespread use of technology, large volumes of collected data began to emerge. It is impossible to discover and analyze any information in large data like this, so in this case data mining comes into play. Data mining is a process that discovers unpredictable and usable knowledge from databases. In other words, data mining is defined as the process of finding relation patterns, changes, deviations and trends, as well as interesting information specific structures from large databases. One of the widely used data mining methods is a method of clustering. Clustering divides the data set into different clusters, and it tries to make the likelihood ratio as minimum inside the cluster and as maximum among other clusters depending on the options in the database. In this study, partitioning-based clustering methods are discussed by applying them on data sets with different distribution patterns. We used k-means and kernel k-means partitioning algorithms for clustering data sets. By applying clustering operations on differently distributed data sets, we compared the speed, clustering quality and the size of memory usage for these algorithms. The information that we gathered by this comparison is presented and discussed in the related sections of this paper.
From Page :
56
NaturalLanguageKeyword :
clustering algorithms , clustering analysis.
JournalTitle :
Erciyes University Journal Of The Institute Of Science an‎d Technology
To Page :
62
Link To Document :
بازگشت