DocumentCode
3027879
Title
Improved CURE algorithm and application of clustering for large-scale data
Author
Xiufeng, Shao ; Wei, Cheng
Author_Institution
Dept. of Soft & Inf. Manage., BeiJing City Univ., Beijing, China
Volume
1
fYear
2011
fDate
9-11 Dec. 2011
Firstpage
305
Lastpage
308
Abstract
Aiming at the classification problem of large-scale document information, a large-scale data clustering algorithm based on improved CURE algorithm is proposed. By clustering the data partition and the initial class of after partition, data tracking, the large-scale data hierarchical clustering and sample classification is achieved, that better solved the balance of clustering quality and clustering effectiveness. Taking the actual document processing of Large-scale network data, the experiment results show that the algorithm is efficient.
Keywords
document handling; pattern classification; pattern clustering; CURE algorithm; classification problem; clustering effectiveness; clustering quality; data tracking; large scale data clustering; large scale document information; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Databases; Educational institutions; Partitioning algorithms; CURE algorithm; Clustering for large-scale data; Data mining;
fLanguage
English
Publisher
ieee
Conference_Titel
IT in Medicine and Education (ITME), 2011 International Symposium on
Conference_Location
Cuangzhou
Print_ISBN
978-1-61284-701-6
Type
conf
DOI
10.1109/ITiME.2011.6130839
Filename
6130839
Link To Document