Title :
Study of data ming classification based on genetic algorithm
Author :
Li, Xiaofeng ; Xin, Chan ; Yang, Li Li
Author_Institution :
Dept. of Comput. Appl. Technol., Technol. of Harbin Inst. of Technol., Harbin, China
Abstract :
In view of genetic superiority in data mining algorithms, this paper combines the genetic algorithm and K-means algorithm and presents a genetic algorithm based k-means clustering algorithm and the algorithm to improve genetic clustering algorithm clustering using variable length actual real number of cluster center, and design a new crossover and mutation operators and the introduction of is widely used cluster validity index DB-Index as the target function, it not only better solve the K-means clustering algorithm, the number of clusters is difficult to determine the initial value of sensitivity and defects such as easy to fall into local optimum, and the algorithm efficiency and accuracy of the algorithm are greatly improved and compared with previous algorithms.
Keywords :
data mining; genetic algorithms; pattern classification; pattern clustering; DB-Index; data mining algorithm; data mining classification; genetic clustering algorithm; k-means clustering algorithm; Educational institutions; clustering; data mining; genetic algorithm;
Conference_Titel :
Advanced Computer Theory and Engineering (ICACTE), 2010 3rd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6539-2
DOI :
10.1109/ICACTE.2010.5579835