• DocumentCode
    2106005
  • Title

    Parallel K-PSO based on MapReduce

  • Author

    Junjun Wang ; Dongfeng Yuan ; Mingyan Jiang

  • Author_Institution
    Sch. of Inf. Sci. & Eng., Shandong Univ., Jinan, China
  • fYear
    2012
  • fDate
    9-11 Nov. 2012
  • Firstpage
    1203
  • Lastpage
    1208
  • Abstract
    K-means is widely used in scientific research and commercial applications because of its simplicity and linearity. However, in faced of ever-growing amount of data and higher demand of cluster analysis today, how to improve the performance of K-means has become challenging and significant. So an improved method called parallel K-PSO which combines Particle Swarm Optimization (PSO) with K-means based on MapReduce is proposed in this paper. Firstly, it takes advantage of PSO to improve the global search ability of K-means, and then it makes K-means parallel with MapReduce to enhance its capability of processing massive data. We evaluate the proposed method through experimental results.
  • Keywords
    data mining; parallel processing; particle swarm optimisation; pattern clustering; K-means; MapReduce; cluster analysis; data processing; parallel K-PSO; particle swarm optimization; Hadoop; K-means; MapReduce; PSO;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Technology (ICCT), 2012 IEEE 14th International Conference on
  • Conference_Location
    Chengdu
  • Print_ISBN
    978-1-4673-2100-6
  • Type

    conf

  • DOI
    10.1109/ICCT.2012.6511380
  • Filename
    6511380