• DocumentCode
    1621224
  • Title

    An instruction set and microarchitecture for instruction level distributed processing

  • Author

    Kim, Ho-Seop ; Smith, James E.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Wisconsin Univ., Madison, WI, USA
  • fYear
    2002
  • fDate
    6/24/1905 12:00:00 AM
  • Firstpage
    71
  • Lastpage
    81
  • Abstract
    An instruction set architecture (ISA) suitable for future microprocessor design constraints is proposed. The ISA has hierarchical register files with a small number of accumulators at the top. The instruction stream is divided into chains of dependent instructions (strands) where intra-strand dependences are passed through the accumulator. The general-purpose register file is used for communication between strands and for holding global values that have many consumers. A microarchitecture to support the proposed ISA is proposed and evaluated. The microarchitecture consists of multiple, distributed processing elements. Each PE contains an instruction issue FIFO, a local register (accumulator) and local copy! of register file. The overall simplicity, hierarchical value communication, and distributed implementation will provide a very high clock speed and a relatively short pipeline while maintaining a form of superscalar out-of-order execution. Detailed timing simulations using translated program traces show the proposed microarchitecture is tolerant of global wire latencies. Ignoring the significant clock frequency advantages, a microarchitecture that supports a 4-wide fetch/decode pipeline, 8 serial PEs, and a two-cycle inter-PE communication latency performs as well as a conventional 4-way out-of-order superscalar processor
  • Keywords
    computational complexity; computer architecture; instruction sets; timing; accumulator; accumulators; communication latency; distributed processing elements; general-purpose register file; hierarchical register files; instruction level distributed processing; instruction set; local register; microarchitecture; microprocessor design constraints; register file; timing simulations; translated program traces; Clocks; Delay; Distributed processing; Instruction sets; Microarchitecture; Microprocessors; Out of order; Pipelines; Timing; Wire;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Architecture, 2002. Proceedings. 29th Annual International Symposium on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1063-6897
  • Print_ISBN
    0-7695-1605-X
  • Type

    conf

  • DOI
    10.1109/ISCA.2002.1003563
  • Filename
    1003563