مرکز منطقه ای اطلاع رساني علوم و فناوري - Exploiting Outer Loop Parallelism of Nested Loop on Coarse-Grained Reconfigurable Architectures

DocumentCode :

3652787

Title :

Exploiting Outer Loop Parallelism of Nested Loop on Coarse-Grained Reconfigurable Architectures

Author :

Dajiang Liu;Shouyi Yin;Leibo Liu;Shaojun Wei

Author_Institution :

Inst. of Microelectron., Tsinghua Univ., Beijing, China

fYear :

2014

fDate :

5/1/2014 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

A coarse-grained reconfigurable architecture is a promising architecture with high power efficiency, which is typically composed of a host controller and a processing element array (PEA). Loops are often mapped onto PEAs for acceleration. In previous work, innermost loop is pipelined, and the the maximal number of concurrently executable operators (CEOs) in the kernel is limited by the inner loop. The loop body DFG of the input 2D nested loop with a inner loop carried dependence ([0,1]) and outer loop carried dependence ([1,1]). We would map this loop onto a 4×4 PEA with pipelining. We assume that the latency of executing one loop iteration is L_b, and the number of iterations involved at one cycle in the kernel phase of pipelining is W_k. As there is a inner loop dependence ([0,1]), the initiation interval (II_i) of inner loop pipelining could be minimized to 1 and we get W_k = 4. We also note that the angle α is contained by two sides in Figure 1(b), which could be written as follow: tan(α) = Wk/Lb = 1/II_i.

Keywords :

"Kernel","Pipeline processing","Arrays","Microelectronics","Educational institutions"

Publisher :

ieee

Conference_Titel :

Field-Programmable Custom Computing Machines (FCCM), 2014 IEEE 22nd Annual International Symposium on

Type :

conf

DOI :

10.1109/FCCM.2014.19

Filename :

6861581

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3652787