DocumentCode :
602531
Title :
Identification of non-disjoint clusters with small and parameterizable overlaps
Author :
Ben N´Cir, Chiheb-Eddine ; Cleuziou, G. ; Essoussi, Nadia
fYear :
2013
fDate :
20-22 Jan. 2013
Firstpage :
1
Lastpage :
6
Abstract :
Identification of non-disjoint groups in unlabeled data sets is an important issue in clustering. Many real life applications require to find overlapping clusters in order to fit the data set structures such as clustering of films where each film can have different genres. This paper presents an overlapping k-means method refereed as Restricted-OKM (Restricted Overlapping k-means) that generalizes the well known k-means algorithm to detect overlapping clusters. The proposed method produces restricted overlapping boundaries between clusters and improves clustering accuracy to make the method adapted for clustering data with small overlaps. The proposed method is extended to control sizes of overlaps between clusters with respect to user expectations. Experiments, performed on overlapping data sets, show that proposed methods outperform OKM (Overlapping k-means) and fuzzy c-means in terms of clustering accuracy and produce clusters with small overlapping boundaries.
Keywords :
fuzzy set theory; identification; pattern clustering; data clustering; data set structure; film clustering; fuzzy c-means; identification; k-means algorithm; nondisjoint cluster; nondisjoint group; overlapping cluster; overlapping k-means method; parameterizable overlap; restricted overlapping boundary; restricted overlapping k-means; restricted-OKM; unlabeled data set; user expectation; Clustering algorithms; Clustering methods; Equations; Linear programming; Minimization; Prototypes; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Applications Technology (ICCAT), 2013 International Conference on
Conference_Location :
Sousse
Print_ISBN :
978-1-4673-5284-0
Type :
conf
DOI :
10.1109/ICCAT.2013.6522010
Filename :
6522010
Link To Document :
بازگشت