• DocumentCode
    1711436
  • Title

    Address generation scheme for a coarse grain reconfigurable architecture

  • Author

    Shami, Muhammad Ali ; Hemani, Ahmed

  • Author_Institution
    Sch. of ICT, R. Inst. of Technol., Stockholm, Sweden
  • fYear
    2011
  • Firstpage
    17
  • Lastpage
    24
  • Abstract
    In this paper, we describe a versatile address generation scheme for distributed storage resources of a coarse grain Parallel Distributed Digital Signal Processing (PDDSP) reconfigurable architecture under development in our group. This scheme proposes the distributed address generation units (AGUs) to decouple the address generation logic with compute logic to exploit parallelism (ILP and TLP). To achieve this, the proposed distributed address generation scheme with standard DSP address generation modes like linear vectorized, circular buffer and bit-reverse addressing, all with parameterizable address range and increment/decrement offsets is further enhanced with temporal flexibility by introducing three dynamically programmable delays: initial delay before the stream starts, middle delay after every address generation for the stream and end delay after the stream is complete. The dynamic programmability of these delays makes streams elastic that can be chained with an interrupt mechanism to create chained-elastic streams. Our approach is compared with the traditional approach of using VLIW and Scalar. Our approach shows 21× (Scalar), 10×(VLIW) reduction in instructions and 2×(Scalar) reduction in cycles for a single thread FIR filter. When compared for Synchronous and Asynchronous scenarios of two parallel treads T1 and T2, our approach shows 4.6×(Scalar), 5.6×(VLIW) reduction in instructions, 1.76× reduction in cycles for Synchronous and 4.6×(Scalar), 15×(VLIW) reduction in instructions, 1.76×(Scalar) reduction in cycles for Asynchronous threads.
  • Keywords
    digital signal processing chips; distributed processing; reconfigurable architectures; VLIW; address generation scheme; coarse grain reconfigurable architecture; distributed address generation units; distributed storage resources; interrupt mechanism; parallel distributed digital signal processing; temporal flexibility; Delay; Educational institutions; Fabrics; Program processors; Registers; VLIW;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Application-Specific Systems, Architectures and Processors (ASAP), 2011 IEEE International Conference on
  • Conference_Location
    Santa Monica, CA
  • ISSN
    2160-0511
  • Print_ISBN
    978-1-4577-1291-3
  • Electronic_ISBN
    2160-0511
  • Type

    conf

  • DOI
    10.1109/ASAP.2011.6043232
  • Filename
    6043232