• DocumentCode
    3027879
  • Title

    Improved CURE algorithm and application of clustering for large-scale data

  • Author

    Xiufeng, Shao ; Wei, Cheng

  • Author_Institution
    Dept. of Soft & Inf. Manage., BeiJing City Univ., Beijing, China
  • Volume
    1
  • fYear
    2011
  • fDate
    9-11 Dec. 2011
  • Firstpage
    305
  • Lastpage
    308
  • Abstract
    Aiming at the classification problem of large-scale document information, a large-scale data clustering algorithm based on improved CURE algorithm is proposed. By clustering the data partition and the initial class of after partition, data tracking, the large-scale data hierarchical clustering and sample classification is achieved, that better solved the balance of clustering quality and clustering effectiveness. Taking the actual document processing of Large-scale network data, the experiment results show that the algorithm is efficient.
  • Keywords
    document handling; pattern classification; pattern clustering; CURE algorithm; classification problem; clustering effectiveness; clustering quality; data tracking; large scale data clustering; large scale document information; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Databases; Educational institutions; Partitioning algorithms; CURE algorithm; Clustering for large-scale data; Data mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    IT in Medicine and Education (ITME), 2011 International Symposium on
  • Conference_Location
    Cuangzhou
  • Print_ISBN
    978-1-61284-701-6
  • Type

    conf

  • DOI
    10.1109/ITiME.2011.6130839
  • Filename
    6130839