DocumentCode
341797
Title
A decoupled scheduled dataflow multithreaded architecture
Author
Kavi, Krishna M. ; Kim, Hyong-Shik ; Arul, Joseph ; Hurson, Ali R.
Author_Institution
Dept. of Electr. & Comput. Eng., Alabama Univ., Huntsville, AL, USA
fYear
1999
fDate
1999
Firstpage
138
Lastpage
143
Abstract
Proposes a new approach to building multithreaded uniprocessors that become building blocks in high-end computing architectures. Our innovativeness stems from a multithreaded architecture with non-blocking threads where all memory accesses are decoupled from the thread execution. Data is pre-loaded into the thread context (registers), and all results are post-stored after the completion of the thread execution. The decoupling of memory accesses from thread execution requires a separate unit to perform the necessary pre-loads and post-stores, and to control the allocation of hardware thread contexts to enabled threads. This separation facilitates achieving high locality and minimizing the impact of distribution and hierarchy in large memory systems. The non-blocking nature of threads eliminates the need for thread switching, thus improving the overhead in scheduling threads. The functional execution paradigm eliminates complex hardware required for scheduling instructions for modern superscalar architectures. We present preliminary results obtained from Monte Carlo simulations of the proposed architectural features
Keywords
Monte Carlo methods; data flow computing; multi-threading; parallel architectures; processor scheduling; Monte Carlo simulations; decoupled memory accesses; decoupled scheduled dataflow multithreaded architecture; distribution; functional execution paradigm; hardware thread context allocation; hierarchy; high-end computing architectures; large memory systems; locality; multithreaded uniprocessors; nonblocking threads; post-stored results; pre-loaded data; registers; superscalar architectures; thread scheduling overhead; Computer architecture; Computer science; Cost function; Delay; Hardware; Multithreading; Pipelines; Processor scheduling; Read only memory; Yarn;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Architectures, Algorithms, and Networks, 1999. (I-SPAN '99) Proceedings. Fourth InternationalSymposium on
Conference_Location
Perth/Fremantle, WA
ISSN
1087-4089
Print_ISBN
0-7695-0231-8
Type
conf
DOI
10.1109/ISPAN.1999.778930
Filename
778930
Link To Document