• DocumentCode
    3130177
  • Title

    Extended split-issue: enabling flexibility in the hardware implementation of NUAL VLIW DSPs

  • Author

    Iyer, Bharath ; Srinivasan, Sadagopan ; Jacob, Bruce

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Maryland Univ., College Park, MD, USA
  • fYear
    2004
  • fDate
    19-23 June 2004
  • Firstpage
    364
  • Lastpage
    375
  • Abstract
    VLIW architecture based DSPs have become widespread due to the combined benefits of simple hardware and compiler-extracted instruction-level parallelism. However, the VLIW instruction set architecture and its hardware implementation are tightly coupled, especially so for Non-Unit Assumed Latency (NUAL) VLIWs. The problem of object code compatibility across processors having different numbers of functional units or hardware latencies has been the Achilles´ heel of this otherwise powerful architecture. In this paper, we propose eXtended Split-Issue (XSI), a novel mechanism that breaks the instruction packet syntax of an NUAL VLIW compiler without violating the dataflow dependences. XSI provides a designer the freedom of disassociating the hardware implementation of the NUAL VLIW processor from the instruction set architecture. Further, we investigate fairly radical (in the context of VLIW) changes to the hardware-like removing an adder, adding a multiplier, and incorporating simultaneous multithreading (SMT) - to show that our technique works for a variety of hardware configurations without compromising on performance. The technique can be used in both single-threaded and multi-threaded architectures to achieve a level of flexibility heretofore unavailable in the VLIW arena.
  • Keywords
    digital signal processing chips; instruction sets; multi-threading; multiprocessing systems; parallel architectures; parallel machines; program compilers; NUAL VLIW DSP; NUAL VLIW compiler; NUAL VLIW processor; VLIW architecture; VLIW instruction set architecture; compiler-extracted instruction-level parallelism; dataflow dependences; extended split-issue; hardware configurations; hardware implementation; hardware latencies; hardware parallelism; instruction packet syntax; multithreaded architecture; nonunit assumed latency; object code compatibility; simultaneous multithreading; single-threaded architecture; Computer architecture; Costs; Delay; Digital signal processing; Hardware; Instruction sets; Jacobian matrices; Program processors; Programming profession; VLIW;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture, 2004. Proceedings. 31st Annual International Symposium on
  • ISSN
    1063-6897
  • Print_ISBN
    0-7695-2143-6
  • Type

    conf

  • DOI
    10.1109/ISCA.2004.1310788
  • Filename
    1310788