• DocumentCode
    2994478
  • Title

    A Parallel K-Means Clustering Algorithm with MPI

  • Author

    Zhang, Jing ; Wu, Gongqing ; Hu, Xuegang ; Li, Shiying ; Hao, Shuilong

  • Author_Institution
    Sch. of Comput. Sci. & Inf. Eng., Hefei Univ. of Technol., Hefei, China
  • fYear
    2011
  • fDate
    9-11 Dec. 2011
  • Firstpage
    60
  • Lastpage
    64
  • Abstract
    Clustering is one of the most popular methods for data analysis, which is prevalent in many disciplines such as image segmentation, bioinformatics, pattern recognition and statistics etc. The most popular and simplest clustering algorithm is K-means because of its easy implementation, simplicity, efficiency and empirical success. However, the real-world applications produce huge volumes of data, thus, how to efficiently handle of these data in an important mining task has been a challenging and significant issue. In addition, MPI (Message Passing Interface) as a programming model of message passing presents high performances, scalability and portability. Motivated by this, a parallel K-means clustering algorithm with MPI, called MKmeans, is proposed in this paper. The algorithm enables applying the clustering algorithm effectively in the parallel environment. Experimental study demonstrates that MKmeans is relatively stable and portable, and it performs with low overhead of time on large volumes of data sets.
  • Keywords
    data analysis; message passing; pattern clustering; MPI; bioinformatics; data analysis; image segmentation; message passing interface; parallel k-means clustering algorithm; pattern recognition; statistical analysis; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Message passing; Parallel processing; Partitioning algorithms; K-means algorithm; MPI; clustering; parallel computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Architectures, Algorithms and Programming (PAAP), 2011 Fourth International Symposium on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-1-4577-1808-3
  • Type

    conf

  • DOI
    10.1109/PAAP.2011.17
  • Filename
    6128477