• DocumentCode
    170416
  • Title

    Accelerating embarrassingly parallel algorithm on Intel MIC

  • Author

    Qinglin Wang ; Jie Liu ; Xiantuo Tang ; Feng Wang ; Guitao Fu ; Zuocheng Xing

  • Author_Institution
    Sci. & Technol. on Parallel & Distrib. Process. Lab., Nat. Univ. of Defense Technol., Changsha, China
  • fYear
    2014
  • fDate
    16-18 May 2014
  • Firstpage
    213
  • Lastpage
    218
  • Abstract
    The Embarrassingly Parallel (EP) algorithm which is typical of many Monte Carlo applications provides an estimate of the upper achievable limits for double precision performance of parallel supercomputers. Recently, Intel released Many Integrated Core (MIC) architecture as a many-core co-processor. MIC often offers more than 50 cores each of which can run four hardware threads as well as 512-bit vector instructions. In this paper, we describe how the EP algorithm is accelerated effectively on the platforms containing MIC using the offload execution model. The result shows that the efficient implementation of EP algorithm on MIC can take full advantage of MIC´s computational resources and achieves a speedup of 3.06 compared with that on Intel Xeon E5-2670 CPU. Based on the EP algorithm on MIC and an effective task distribution model, the implementation of EP algorithm on a CPU-MIC heterogeneous platform achieves the performance of up to 2134.86 Mop/s and 4.04 times speedup compared with that on Intel Xeon E5-2670 CPU.
  • Keywords
    Monte Carlo methods; multiprocessing systems; parallel algorithms; parallel architectures; parallel machines; 512-bit vector instructions; CPU-MIC heterogeneous platform; EP algorithm; Intel MIC; Intel released many integrated core architecture; MIC computational resources; Monte Carlo applications; embarrassingly parallel algorithm; hardware threads; offload execution model; task distribution model; Algorithm design and analysis; Clustering algorithms; Computer architecture; Graphics processing units; Load modeling; Microwave integrated circuits; Vectors; NPB embarrassingly parallel algorithm; heterogeneous platform; many integrated core architecture;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Progress in Informatics and Computing (PIC), 2014 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4799-2033-4
  • Type

    conf

  • DOI
    10.1109/PIC.2014.6972327
  • Filename
    6972327