• DocumentCode
    3074798
  • Title

    A comparative benchmarking of the FFT on Fermi and Evergreen GPUs

  • Author

    Ahmed, Mohamed F. ; Haridy, Omar

  • Author_Institution
    Comput. Sci. & Eng., American Univ. in Cairo, Cairo, Egypt
  • fYear
    2011
  • fDate
    10-12 April 2011
  • Firstpage
    127
  • Lastpage
    128
  • Abstract
    NVIDIA and AMD GPUs are are gaining traction in HPC for their performance and architectural aspects. It is very important to measure and analyze the relative power of each architecture. In this paper, we analyze the architecture of NVIDIA´s Fermi and AMD´s Evergreen processors and demonstrate the best practices and techniques to best utilize the capabilities of each architecture. We implemented the FFT on both cards utilizing our findings to reach new performance ceilings on both GPUs.
  • Keywords
    benchmark testing; computer graphic equipment; coprocessors; fast Fourier transforms; performance evaluation; storage management; AMD Evergreen GPU; FFT; NVIDIA Fermi GPU; architecture analysis; comparative benchmarking; performance ceiling; Bandwidth; Concurrent computing; Graphics processing unit; High definition video; Instruction sets; Pipelines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Performance Analysis of Systems and Software (ISPASS), 2011 IEEE International Symposium on
  • Conference_Location
    Austin, TX
  • Print_ISBN
    978-1-61284-367-4
  • Electronic_ISBN
    978-1-61284-368-1
  • Type

    conf

  • DOI
    10.1109/ISPASS.2011.5762726
  • Filename
    5762726