DocumentCode :
3209011
Title :
An architecture framework for transparent instruction set customization in embedded processors
Author :
Clark, Nathan ; Blome, Jason ; Chu, Michael ; Mahlke, Scott ; Biles, Stuart ; Flautner, Krisztián
Author_Institution :
Adv. Comput. Archit. Lab., Michigan Univ., Ann Harbor, MI, USA
fYear :
2005
fDate :
4-8 June 2005
Firstpage :
272
Lastpage :
283
Abstract :
Instruction set customization is an effective way to improve processor performance. Critical portions of application data-flow graphs are collapsed for accelerated execution on specialized hardware. Collapsing dataflow subgraphs will compress the latency along critical paths and reduces the number of intermediate results stored in the register file. While custom instructions can be effective, the time and cost of designing a new processor for each application is immense. To overcome this roadblock, this paper proposes a flexible architectural framework to transparently integrate custom instructions into a general-purpose processor. Hardware accelerators are added to the processor to execute the collapsed subgraphs. A simple microarchitectural interface is provided to support a plug-and-play model for integrating a wide range of accelerators into a pre-designed and verified processor core. The accelerators are exploited using an approach of static identification and dynamic realization. The compiler is responsible for identifying profitable subgraphs, while the hardware handles discovery, mapping, and execution of compatible subgraphs. This paper presents the design of a plug-and-play transparent accelerator system arid evaluates the cost/performance implications of the design.
Keywords :
data flow graphs; embedded systems; instruction sets; performance evaluation; program compilers; reduced instruction set computing; architecture framework; collapsed subgraphs; collapsing dataflow subgraphs; data-flow graphs; dynamic realization; embedded processors; flexible architectural framework; hardware accelerators; microarchitectural interface; plug-and-play transparent accelerator system; processor core; static identification; transparent instruction set customization; Acceleration; Application software; Application specific processors; Computer architecture; Costs; Delay; Hardware; Laboratories; Process design; Registers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture, 2005. ISCA '05. Proceedings. 32nd International Symposium on
ISSN :
1063-6897
Print_ISBN :
0-7695-2270-X
Type :
conf
DOI :
10.1109/ISCA.2005.9
Filename :
1431563
Link To Document :
بازگشت