Title : 
Improving Performance of Codes with Large/Irregular Stride Memory Access Patterns via High Performance Reconfigurable Computers
         
        
            Author : 
Abed, Khalid H. ; Morris, Gerald R.
         
        
            Author_Institution : 
Sch. of Eng., Dept. of Comput. Eng., Jackson State Univ., Jackson, MS, USA
         
        
        
        
        
        
            Abstract : 
Parallel codes with large-stride/irregular-stride (L/I) memory access patterns, e.g., sparse matrix and linked list codes, often perform poorly on mainstream clusters because of the general purpose processor (GPP) memory hierarchy. High performance reconfigurable computers (HPRCs) are parallel computing clusters containing multiple GPPs and field programmable gate arrays (FPGAs) connected via a high-speed network. In this research, simple 64-bit floating-point parallel codes are used to illustrate the performance impact of L/I memory accesses in software (SW) and FPGA-augmented (FA) codes and to assess the benefits of mapping L/I-type codes onto HPRCs. The experiments reveal that large-stride SW codes, particularly those involving data reuse, experience severe performance degradation compared with unit-stride SW codes. In contrast, large-stride FA codes experience minimal degradation compared with unit-stride FA codes. More importantly, for codes that involve data reuse, the experiments demonstrate performance improvements of up to nearly tenfold for large-stride FA codes compared with large-stride SW codes.
         
        
            Keywords : 
codes; field programmable gate arrays; parallel processing; storage management; FPGA-augmented codes; field programmable gate arrays; floating-point parallel codes; general purpose processor memory hierarchy; high performance reconfigurable computers; irregular-stride memory access patterns; large-stride stride memory access; linked list code; sparse matrix code; unit-stride software codes; Computers; Equations; Field programmable gate arrays; Kernel; Parallel processing; Sparse matrices;
         
        
        
        
            Conference_Titel : 
DoD High Performance Computing Modernization Program Users Group Conference (HPCMP-UGC), 2009
         
        
            Conference_Location : 
San Diego, CA
         
        
            Print_ISBN : 
978-1-4244-5768-7
         
        
        
            DOI : 
10.1109/HPCMP-UGC.2009.70