• DocumentCode
    228635
  • Title

    Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors

  • Author

    Heybrock, Simon ; Joo, Balint ; Kalamkar, Dhiraj D. ; Smelyanskiy, Mikhail ; Vaidyanathan, Karthikeyan ; Wettig, Tilo ; Dubey, Pradeep

  • Author_Institution
    Inst. for Theor. Phys., Univ. of Regensburg, Regensburg, Germany
  • fYear
    2014
  • fDate
    16-21 Nov. 2014
  • Firstpage
    69
  • Lastpage
    80
  • Abstract
    The gap between the cost of moving data and the cost of computing continues to grow, making it ever harder to design iterative solvers on extreme-scale architectures. This problem can be alleviated by alternative algorithms that reduce the amount of data movement. We investigate this in the context of Lattice Quantum Chromo dynamics and implement such an alternative solver algorithm, based on domain decomposition, on Intel® Xeon Phi co-processor (KNC) clusters. We demonstrate close-to-linear on-chip scaling to all 60 cores of the KNC. With a mix of single- and half-precision the domain-decomposition method sustains 400-500 Gflop/s per chip. Compared to an optimized KNC implementation of a standard solver [1], our full multi-node domain-decomposition solver strong-scales to more nodes and reduces the time-to-solution by a factor of 5.
  • Keywords
    coprocessors; data handling; iterative methods; lattice theory; multiprocessing systems; physics computing; quantum chromodynamics; Intel Xeon Phi coprocessors; KNC cluster; alternative solver algorithm; close-to-linear on-chip scaling; data movement; domain decomposition; extreme-scale architecture; iterative solvers; lattice QCD; lattice quantum chromodynamics; multinode domain-decomposition solver; Gold; Jacobian matrices; Lattices; Layout; Linear systems; Prefetching; Vectors; Domain decomposition; G.1.3 [Numerical Analysis]: Numerical Linear Algebra Sparse; Intel® Xeon Phi coprocessor; Lattice QCD Categories and subject descriptors: D.3.4 [Programming Languages]: Processors Optimization; and very la; structured;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for
  • Conference_Location
    New Orleans, LA
  • Print_ISBN
    978-1-4799-5499-5
  • Type

    conf

  • DOI
    10.1109/SC.2014.11
  • Filename
    7012993