• DocumentCode
    1593982
  • Title

    Accelerating K-Means on the Graphics Processor via CUDA

  • Author

    Zechner, Mario ; Granitzer, Michael

  • Author_Institution
    Know-Center, Graz
  • fYear
    2009
  • Firstpage
    7
  • Lastpage
    15
  • Abstract
    In this paper an optimized k-means implementation on the graphics processing unit (GPU) is presented. NVIDIApsilas compute unified device architecture (CUDA), available from the G80 GPU family onwards, is used as the programming environment. Emphasis is placed on optimizations directly targeted at this architecture to best exploit the computational capabilities available. Additionally drawbacks and limitations of previous related work, e.g. maximum instance, dimension and centroid count are addressed. The algorithm is realized in a hybrid manner, parallelizing distance calculations on the GPU while sequentially updating cluster centroids on the CPU based on the results from the GPU calculations. An empirical performance study on synthetic data is given, demonstrating a maximum 14times speed increase to a fully SIMD optimized CPU implementation.
  • Keywords
    computer graphic equipment; optimisation; parallel architectures; NVIDIA compute unified device architecture; SIMD optimized CPU implementation; distance calculations; graphics processing unit; graphics processor; optimized k-means implementation; Acceleration; Central Processing Unit; Clustering algorithms; Computer architecture; Data mining; Graphics; Hardware; Optimization methods; Pipelines; Programming environments;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intensive Applications and Services, 2009. INTENSIVE '09. First International Conference on
  • Conference_Location
    Valencia
  • Print_ISBN
    978-1-4244-3683-5
  • Electronic_ISBN
    978-0-7695-3585-2
  • Type

    conf

  • DOI
    10.1109/INTENSIVE.2009.19
  • Filename
    4976415