• DocumentCode
    2545936
  • Title

    Clustering Algorithm on Block Division of Documents

  • Author

    Liu, Gang ; Luo, Mingyue

  • Author_Institution
    Sch. of Electron. & Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2010
  • fDate
    23-25 Sept. 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In the traditional K-means algorithm, the selection of cluster number and the initial cluster center brings huge affection on the quality of clustering. To reduce the dependence on the initial center and to locate the types of new data rapidly, an algorithm applicable for text data is proposed. In this algorithm, document density is considered as parameter. Documents are divided into blocks first. After that every divided block is clustered separately. Experiment shows that this algorithm not only makes higher quality for clustering, but also does well in the new increasing data.
  • Keywords
    document handling; pattern clustering; K-means algorithm; clustering algorithm; clustering quality; document block division; document density; Algorithm design and analysis; Clustering algorithms; Computational modeling; Fluctuations; Internet; Partitioning algorithms; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications Networking and Mobile Computing (WiCOM), 2010 6th International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4244-3708-5
  • Electronic_ISBN
    978-1-4244-3709-2
  • Type

    conf

  • DOI
    10.1109/WICOM.2010.5600166
  • Filename
    5600166