• DocumentCode
    2190363
  • Title

    Accelerating Linpack Performance with Mixed Precision Algorithm on CPU+GPGPU Heterogeneous Cluster

  • Author

    Wang Lei ; Zhang Yunquan ; Zhang Xianyi ; Liu FangFang

  • Author_Institution
    Lab. of Parallel Comput., Chinese Acad. of Sci., Beijing, China
  • fYear
    2010
  • fDate
    June 29 2010-July 1 2010
  • Firstpage
    1169
  • Lastpage
    1174
  • Abstract
    In this paper, the mixed precision algorithm to solve the linear system of equations and the implementation of HPL package are introduced. We use this mixed precision algorithm to improve HPL package on CPU + GPGPU heterogeneous clusters, which is named for GHPL, and give the implementation mechanisms in detail. The experimental results are measured on the platforms of multi-core CPUs and CPU + GPGPU heterogeneous clusters. From the experimental results, we can find out that our GHPL program has good scalability on all the experimental environments and can sustain more than 1.7Teraflops both on the cluster with 16 nodes containing 32 NVIDIA Tesla C1060 GPUs and on the cluster with 8 nodes containing 32 ATI GeForce GTX 295 GPUs, while the average speedup of it with respect to HPL is 3.06 and 2.40 respectively.
  • Keywords
    coprocessors; graphical user interfaces; linear systems; CPU+GPGPU heterogeneous cluster; GHPL program; Linpack performance acceleration; NVIDIA Tesla C1060 GPU; linear equations; mixed precision algorithm; multicore CPU; Accuracy; Benchmark testing; Clustering algorithms; Convergence; Graphics processing unit; Linear systems; Niobium; CUDA; HPL; mixed precision algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology (CIT), 2010 IEEE 10th International Conference on
  • Conference_Location
    Bradford
  • Print_ISBN
    978-1-4244-7547-6
  • Type

    conf

  • DOI
    10.1109/CIT.2010.212
  • Filename
    5577898