Title :
A Case Study of Hardware/Software Partitioning of Traffic Simulation on the Cray XD1
Author :
Tripp, Justin L. ; Gokhale, Maya B. ; Hansson, A..
Author_Institution :
Los Alamos Nat. Lab., Los Alamos
Abstract :
Scientific application kernels mapped to reconfigurable hardware have been reported to have 10times to 100times speedup over equivalent software. These promising results suggest that reconfigurable logic might offer significant speedup on applications in science and engineering. To accurately assess the benefit of hardware acceleration on scientific applications, however, it is necessary to consider the entire application including software components as well as the accelerated kernels. Aspects to be considered include alternative methods of hardware/software partitioning, communications costs, and opportunities for concurrent computation between software and hardware. Analysis of these factors is beyond the scope of current automatic parallelizing compilers. In this paper, a case study is presented in which a simulation of metropolitan road traffic networks is mapped onto a reconfigurable supercomputer, the Cray XD1. Five different methods are presented for mapping the application onto the combined hardware/software system. An approach for approximating the performance of each method is derived through analytic equations. Our results, both analytically and empirically, show that key predictors of performance (which are often not considered in reported speedup of kernel operations) are not necessarily maximum parallelism, but must account for the fraction of the problem that runs on the reconfigurable logic and the amount data flow between software and hardware.
Keywords :
data flow analysis; hardware-software codesign; parallel machines; reconfigurable architectures; traffic engineering computing; Cray XD1 supercomputer; application speedup; data flow; hardware acceleration; hardware-software partitioning; hardware-software system; kernel operations; metropolitan road traffic network; reconfigurable hardware; reconfigurable logic; scientific application kernels; software components; traffic simulation; Acceleration; Application software; Computational modeling; Concurrent computing; Costs; Hardware; Kernel; Performance analysis; Reconfigurable logic; Traffic control; Hardware/software codesign; simulation; system integration;
Journal_Title :
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
DOI :
10.1109/TVLSI.2007.912126