A decoupled scheduled dataflow multithreaded architecture

Author

Kavi, Krishna M. ; Kim, Hyong-Shik ; Arul, Joseph ; Hurson, Ali R.

Author_Institution

Dept. of Electr. & Comput. Eng., Alabama Univ., Huntsville, AL, USA

fYear

1999

fDate

1999

Firstpage

138

Lastpage

143

Abstract

Proposes a new approach to building multithreaded uniprocessors that become building blocks in high-end computing architectures. Our innovativeness stems from a multithreaded architecture with non-blocking threads where all memory accesses are decoupled from the thread execution. Data is pre-loaded into the thread context (registers), and all results are post-stored after the completion of the thread execution. The decoupling of memory accesses from thread execution requires a separate unit to perform the necessary pre-loads and post-stores, and to control the allocation of hardware thread contexts to enabled threads. This separation facilitates achieving high locality and minimizing the impact of distribution and hierarchy in large memory systems. The non-blocking nature of threads eliminates the need for thread switching, thus improving the overhead in scheduling threads. The functional execution paradigm eliminates complex hardware required for scheduling instructions for modern superscalar architectures. We present preliminary results obtained from Monte Carlo simulations of the proposed architectural features

Keywords

Monte Carlo methods; data flow computing; multi-threading; parallel architectures; processor scheduling; Monte Carlo simulations; decoupled memory accesses; decoupled scheduled dataflow multithreaded architecture; distribution; functional execution paradigm; hardware thread context allocation; hierarchy; high-end computing architectures; large memory systems; locality; multithreaded uniprocessors; nonblocking threads; post-stored results; pre-loaded data; registers; superscalar architectures; thread scheduling overhead; Computer architecture; Computer science; Cost function; Delay; Hardware; Multithreading; Pipelines; Processor scheduling; Read only memory; Yarn;

fLanguage

English

Publisher

ieee

Conference_Titel

Parallel Architectures, Algorithms, and Networks, 1999. (I-SPAN '99) Proceedings. Fourth InternationalSymposium on

Conference_Location

Perth/Fremantle, WA

ISSN

1087-4089

Print_ISBN

0-7695-0231-8

Type

conf

DOI

10.1109/ISPAN.1999.778930

Filename

778930