Title :
FPGA-based implementation of Horner´s rule on a high performance heterogeneous computer
Author :
Malone, Ales-cia N. ; Morris, Gerald R. ; Abed, Khalid H.
Author_Institution :
Comput. Eng., Jackson State Univ., Jackson, MS, USA
Abstract :
Codes that have large/irregular-stride (L/I) memory access patterns often perform poorly on mainstream clusters because of the general purpose processor (GPP) memory hierarchy. In the event of erratic access of data, the cache suffers misses and causes inadequate performance of the kernel. High performance heterogeneous computers (HPHCs) are parallel computing clusters that contain multiple and different processing units such as GPPs, field programmable gate arrays (FPGAs), graphics processing units, etc., connected via a high-speed network. In this research, 64-bit foating-point MPI codes are used to illustrate the run-time performance impact of L/I memory accesses in both software-only and FPGA-augmented codes and to assess the benefits of mapping L/I-type codes onto HPHC clusters. The experiments documented herein reveal that large-stride software-only codes experience unfavorable performance. On the contrary, large-stride FPGA-augmented codes do not endure such disparaging performance because FPGAs are not impacted by stride. For experiments with large data sizes, the unit-stride FPGA-augmented code ran over two times slower than software except for when there was substantial data reuse. On the other hand, the large-stride FPGA-augmented code ran faster than software for all the larger data sizes.
Keywords :
application program interfaces; computer graphic equipment; coprocessors; field programmable gate arrays; message passing; HPHC clusters; Horner rule; data reuse; field programmable gate arrays; floating-point MPI codes; general purpose processor memory hierarchy; graphics processing units; high performance heterogeneous computer; high-speed network; large-irregular-stride memory access patterns; multiple processing units; parallel computing clusters; unit-stride FPGA-augmented code; Algorithm design and analysis; Arrays; Clocks; Computers; Field programmable gate arrays; Runtime; Software;
Conference_Titel :
Southeastcon, 2011 Proceedings of IEEE
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-61284-739-9
DOI :
10.1109/SECON.2011.5752949