DocumentCode :
708633
Title :
Clustering of large data based on the relational analysis
Author :
Slaoui, Said Chah ; Lamari, Yasmine
Author_Institution :
Comput. Sci. Res. Lab., Mohammed V Univ., Rabat, Morocco
fYear :
2015
fDate :
25-26 March 2015
Firstpage :
1
Lastpage :
7
Abstract :
This paper presents a fast heuristic which finds clusters by partitioning categorical large data sets according to the Relational Analysis, whereby the cluster analysis is modeled as a linear integer program with n2 attributes (n is the number of observations) and solved by the optimization under constraints of the Condorcet criterion. Without neither a sampling method nor the fixing of input parameters and while using a natural cluster structure, Transitive heuristic needs a small amount of memory and a short time to provide good quality partition. Experimental results on real and synthetic data sets are presented in order to show that clusters, formed using this technique, are intensive and accurate.
Keywords :
integer programming; linear programming; pattern clustering; statistical analysis; Condorcet criterion; categorical large data sets; cluster analysis; large data clustering; linear integer program; natural cluster structure; relational analysis; Algorithm design and analysis; Clustering algorithms; Generators; Heuristic algorithms; Linear programming; Partitioning algorithms; Sampling methods; Categorical data; Cluster analysis; Condorcet criterion; Partitioning heuristic; Relational Analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems and Computer Vision (ISCV), 2015
Conference_Location :
Fez
Print_ISBN :
978-1-4799-7510-5
Type :
conf
DOI :
10.1109/ISACV.2015.7105550
Filename :
7105550
Link To Document :
بازگشت