Title :
Clustering of large data based on the relational analysis
Author :
Slaoui, Said Chah ; Lamari, Yasmine
Author_Institution :
Comput. Sci. Res. Lab., Mohammed V Univ., Rabat, Morocco
Abstract :
This paper presents a fast heuristic which finds clusters by partitioning categorical large data sets according to the Relational Analysis, whereby the cluster analysis is modeled as a linear integer program with n2 attributes (n is the number of observations) and solved by the optimization under constraints of the Condorcet criterion. Without neither a sampling method nor the fixing of input parameters and while using a natural cluster structure, Transitive heuristic needs a small amount of memory and a short time to provide good quality partition. Experimental results on real and synthetic data sets are presented in order to show that clusters, formed using this technique, are intensive and accurate.
Keywords :
integer programming; linear programming; pattern clustering; statistical analysis; Condorcet criterion; categorical large data sets; cluster analysis; large data clustering; linear integer program; natural cluster structure; relational analysis; Algorithm design and analysis; Clustering algorithms; Generators; Heuristic algorithms; Linear programming; Partitioning algorithms; Sampling methods; Categorical data; Cluster analysis; Condorcet criterion; Partitioning heuristic; Relational Analysis;
Conference_Titel :
Intelligent Systems and Computer Vision (ISCV), 2015
Conference_Location :
Fez
Print_ISBN :
978-1-4799-7510-5
DOI :
10.1109/ISACV.2015.7105550