DocumentCode :
2962899
Title :
Software-managed automatic data sharing for Coarse-Grained Reconfigurable coprocessors
Author :
Mai, Toan X. ; Jongeun Lee
Author_Institution :
Sch. of ECE, UNIST, Ulsan, South Korea
fYear :
2012
fDate :
10-12 Dec. 2012
Firstpage :
277
Lastpage :
284
Abstract :
Coarse-Grained Reconfigurable Architecture (CGRA) in a hybrid system can significantly accelerate the execution of compute-intensive kernels of applications. However, the data communication overhead between the main processor (MP) and the CGRA may be huge and can negate the speed-up of the CGRA. In this paper we address the problem of reducing the data communication overhead in a hybrid system by offering a partially automatic data sharing technique using a special shared memory called Configurable Range Memory (CRM). Unlike the previous work the CRM architecture we use here is based on comparators, which gives much higher flexibility in terms of where an array can be placed within a CRM while it makes the runtime software management of a CRM much more challenging. We present an efficient runtime algorithm based on first-fit heuristic. Our experimental results demonstrate that our CRM-based system can reduce the amount of data transfer between a MP and a CGRA up to 89.5% compared to ScratchPad Memory (SPM)-based systems, while the software management overhead is only 1.20~1.34% on average (depending on CRM architecture parameters) of the kernel cycles in the MP-only execution. Overall our CRM-based system can achieve average kernel speedup of 3.47 times over the MP-only execution, which is about 20% improvement over the SPM-based system.
Keywords :
comparators (circuits); computational complexity; coprocessors; memory architecture; reconfigurable architectures; shared memory systems; software management; CGRA; CRM architecture parameters; MP-only execution; coarse-grained reconfigurable coprocessor architecture; comparators; compute-intensive application kernels; configurable range memory architecture; data communication overhead reduction; data transfer reduction; first-fit heuristic; hybrid system; kernel cycles; main processor; runtime algorithm; runtime software management; shared memory; software management overhead; software-managed partially-automatic data sharing; Arrays; Coprocessors; Customer relationship management; Data transfer; Kernel;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field-Programmable Technology (FPT), 2012 International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-2846-3
Electronic_ISBN :
978-1-4673-2844-9
Type :
conf
DOI :
10.1109/FPT.2012.6412148
Filename :
6412148
Link To Document :
بازگشت