• DocumentCode
    3691856
  • Title

    Measuring Predictability of Nvidia´s GPU Schedulers: Application to the Summation Problem

  • Author

    David Defour

  • Author_Institution
    Lab. DALI-LIRMM, Perpignan, France
  • fYear
    2015
  • Firstpage
    17
  • Lastpage
    24
  • Abstract
    GPU´s are massively multicore architectures managing several thousands of concurrent threads. This concurrency, maintained through several schedulers, is necessary to keep high performance but negatively impact predictability. The lack of predictability is not a problem for most of data parallel applications written in CUDA and therefore hasn´t been widely studied. However for some others, such as the summation of floating-point numbers, this may be problematic as it can lead to deadlock situation. In this work, we first propose measures of predictability as well as CUDA tests to estimate this measure regarding warp and block scheduler for architectures from G80 to GK104. Then, we evaluate how to impact this measure and apply those results to the atomic addition of floating-point numbers and show how to make this operation predictable.
  • Keywords
    "Graphics processing units","Clocks","Computer architecture","Hardware","Atomic measurements"
  • Publisher
    ieee
  • Conference_Titel
    Embedded Multicore/Many-core Systems-on-Chip (MCSoC), 2015 IEEE 9th International Symposium on
  • Type

    conf

  • DOI
    10.1109/MCSoC.2015.9
  • Filename
    7328182