Title : 
Genetically Improved PSO Algorithm for Efficient Data Clustering
         
        
            Author : 
Abdel-Kader, Rehab F.
         
        
            Author_Institution : 
Electr. Eng. Dept., Suez Canal Univ., Port-Said, Egypt
         
        
        
        
        
        
            Abstract : 
Clustering is an important research topic in data mining that appears in a wide range of unsupervised classification applications. Partitional clustering algorithms such as the k-means algorithm are the most popular for clustering large datasets. The major problem with the k-means algorithm is that it is sensitive to the selection of the initial partitions and it may converge to local optima. In this paper, we present a hybrid two-phase GAI-PSO+k-means data clustering algorithm that performs fast data clustering and can avoid premature convergence to local optima. In the first phase we utilize the new genetically improved particle swarm optimization algorithm (GAI-PSO) which is a population-based heuristic search technique modeled on the hybrid of cultural and social rules derived from the analysis of the swarm intelligence (PSO) and the concepts of natural selection and evolution (GA). The GAI-PSO combines the standard velocity and position update rules of PSOs with the ideas of selection, mutation and crossover from GAs. The GAI-PSO algorithm searches the solution space to find the optimal initial cluster centroids for the next phase. The second phase is a local refining stage utilizing the k-means algorithm which can efficiently converge to the optimal solution. The proposed algorithm combines the ability of the globalized searching of the evolutionary algorithms and the fast convergence of the k-means algorithm and can avoid the drawback of both. The performance of the proposed algorithm is evaluated through several benchmark datasets. The experimental results show that the proposed algorithm is highly forceful and outperforms the previous approaches such as SA, ACO, PSO and k-means for the partitional clustering problem.
         
        
            Keywords : 
convergence; data mining; genetic algorithms; particle swarm optimisation; pattern classification; pattern clustering; search problems; PSO; convergence; data clustering; data mining; evolutionary algorithm; genetic algorithm; genetically improved particle swarm optimization algorithm; heuristic search technique; k-means algorithm; partitional clustering algorithm; position update rules; standard velocity rules; unsupervised classification; Clustering algorithms; Data mining; Evolutionary computation; Genetic algorithms; Genetic mutations; Iterative algorithms; Machine learning; Machine learning algorithms; Particle swarm optimization; Partitioning algorithms; Data Clustering; Genetic Algorithm; Hybrid Evolutionary Algorithm; K-means Clustering; Particle Swarm Optimization;
         
        
        
        
            Conference_Titel : 
Machine Learning and Computing (ICMLC), 2010 Second International Conference on
         
        
            Conference_Location : 
Bangalore
         
        
            Print_ISBN : 
978-1-4244-6006-9
         
        
            Electronic_ISBN : 
978-1-4244-6007-6
         
        
        
            DOI : 
10.1109/ICMLC.2010.19