DocumentCode
3074798
Title
A comparative benchmarking of the FFT on Fermi and Evergreen GPUs
Author
Ahmed, Mohamed F. ; Haridy, Omar
Author_Institution
Comput. Sci. & Eng., American Univ. in Cairo, Cairo, Egypt
fYear
2011
fDate
10-12 April 2011
Firstpage
127
Lastpage
128
Abstract
NVIDIA and AMD GPUs are are gaining traction in HPC for their performance and architectural aspects. It is very important to measure and analyze the relative power of each architecture. In this paper, we analyze the architecture of NVIDIA´s Fermi and AMD´s Evergreen processors and demonstrate the best practices and techniques to best utilize the capabilities of each architecture. We implemented the FFT on both cards utilizing our findings to reach new performance ceilings on both GPUs.
Keywords
benchmark testing; computer graphic equipment; coprocessors; fast Fourier transforms; performance evaluation; storage management; AMD Evergreen GPU; FFT; NVIDIA Fermi GPU; architecture analysis; comparative benchmarking; performance ceiling; Bandwidth; Concurrent computing; Graphics processing unit; High definition video; Instruction sets; Pipelines;
fLanguage
English
Publisher
ieee
Conference_Titel
Performance Analysis of Systems and Software (ISPASS), 2011 IEEE International Symposium on
Conference_Location
Austin, TX
Print_ISBN
978-1-61284-367-4
Electronic_ISBN
978-1-61284-368-1
Type
conf
DOI
10.1109/ISPASS.2011.5762726
Filename
5762726
Link To Document