DocumentCode :
3027879
Title :
Improved CURE algorithm and application of clustering for large-scale data
Author :
Xiufeng, Shao ; Wei, Cheng
Author_Institution :
Dept. of Soft & Inf. Manage., BeiJing City Univ., Beijing, China
Volume :
1
fYear :
2011
fDate :
9-11 Dec. 2011
Firstpage :
305
Lastpage :
308
Abstract :
Aiming at the classification problem of large-scale document information, a large-scale data clustering algorithm based on improved CURE algorithm is proposed. By clustering the data partition and the initial class of after partition, data tracking, the large-scale data hierarchical clustering and sample classification is achieved, that better solved the balance of clustering quality and clustering effectiveness. Taking the actual document processing of Large-scale network data, the experiment results show that the algorithm is efficient.
Keywords :
document handling; pattern classification; pattern clustering; CURE algorithm; classification problem; clustering effectiveness; clustering quality; data tracking; large scale data clustering; large scale document information; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Databases; Educational institutions; Partitioning algorithms; CURE algorithm; Clustering for large-scale data; Data mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
IT in Medicine and Education (ITME), 2011 International Symposium on
Conference_Location :
Cuangzhou
Print_ISBN :
978-1-61284-701-6
Type :
conf
DOI :
10.1109/ITiME.2011.6130839
Filename :
6130839
Link To Document :
بازگشت