مرکز منطقه ای اطلاع رساني علوم و فناوري - Balancing ILP and TLP in SMT Architectures through Out-of-Order Instruction Dispatch

DocumentCode :

2503558

Title :

Balancing ILP and TLP in SMT Architectures through Out-of-Order Instruction Dispatch

Author :

Sharkey, Joseph ; Ponomarev, Dmitry

Author_Institution :

Dept. of Comput. Sci. State, New York Univ., Binghamton, NY

fYear :

2006

fDate :

14-18 Aug. 2006

Firstpage :

329

Lastpage :

336

Abstract :

Simultaneous multi-threading (SMT) architectures open up new avenues for datapath optimizations due to the presence of thread-level parallelism (TLP). One recent proposal for exploiting such parallelism is the 2OP_BLOCK scheduler design, which completely avoids the dispatch of instructions with two non-ready source operands into the issue queue. This technique reduces the scheduler complexity and also provides performance benefits for workloads with sufficient TLP, as the issue queue is more efficiently utilized. In this paper we first revisit the 2OP_BLOCK scheduler and show that this design actually results in performance losses for workloads with a limited number of threads because the constraints imposed on the exploitable ILP within each thread outweigh its advantages. To balance the ILP (instruction-level parallelism) and TLP in SMT processors supporting such schedulers, we propose out-of-order dispatch of instructions within each thread. This simple augmentation naturally allows the 2OP_BLOCK scheduler to perform well even when the number of threads is small. Furthermore, for environments with a larger number of threads, the out-of-order dispatch mechanism improves the performance of the original proposal by up to 15% on the average across simulated multithreaded mixes of SPEC 2000 benchmarks

Keywords :

computational complexity; multi-threading; multiprocessing systems; parallel architectures; scheduling; 2OP_BLOCK scheduling; SPEC 2000 benchmarks; datapath optimization; instruction-level parallelism; out-of-order instruction dispatch; scheduling complexity; simultaneous multithreading architectures; thread-level parallelism; Computer architecture; Dynamic scheduling; Logic; Multithreading; Out of order; Parallel processing; Processor scheduling; Proposals; Surface-mount technology; Yarn;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Parallel Processing, 2006. ICPP 2006. International Conference on

Conference_Location :

Columbus, OH

ISSN :

0190-3918

Print_ISBN :

0-7695-2636-5

Type :

conf

DOI :

10.1109/ICPP.2006.28

Filename :

1690635

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2503558