• DocumentCode
    3074866
  • Title

    Where is the data? Why you cannot debate CPU vs. GPU performance without the answer

  • Author

    Gregg, Chris ; Hazelwood, Kim

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Virginia, Charlottesville, VA, USA
  • fYear
    2011
  • fDate
    10-12 April 2011
  • Firstpage
    134
  • Lastpage
    144
  • Abstract
    General purpose GPU Computing (GPGPU) has taken off in the past few years, with great promises for increased desktop processing power due to the large number of fast computing cores on high-end graphics cards. Many publications have demonstrated phenomenal performance and have reported speedups as much as 1000× over code running on multi-core CPUs. Other studies have claimed that well-tuned CPU code reduces the performance gap significantly. We demonstrate that this important discussion is missing a key aspect, specifically the question of where in the system data resides, and the overhead to move the data to where it will be used, and back again if necessary. We have benchmarked a broad set of GPU kernels on a number of platforms with different GPUs and our results show that when memory transfer times are included, it can easily take between 2 to 50× longer to run a kernel than the GPU processing time alone. Therefore, it is necessary to either include memory transfer overhead when reporting GPU performance, or to explain why this is not relevant for the application in question. We suggest a taxonomy for future CPU/GPU comparisons, and we argue that this is not only germane for reporting performance, but is important to heterogeneous scheduling research in general.
  • Keywords
    computer graphic equipment; coprocessors; general purpose computers; microprocessor chips; multiprocessing systems; GPU kernels; desktop processing power; fast computing cores; general purpose GPU computing; high-end graphics cards; memory transfer times; multicore CPUs; system data; Bandwidth; Benchmark testing; Databases; Graphics processing unit; Histograms; Kernel; Performance evaluation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Performance Analysis of Systems and Software (ISPASS), 2011 IEEE International Symposium on
  • Conference_Location
    Austin, TX
  • Print_ISBN
    978-1-61284-367-4
  • Electronic_ISBN
    978-1-61284-368-1
  • Type

    conf

  • DOI
    10.1109/ISPASS.2011.5762730
  • Filename
    5762730