مرکز منطقه ای اطلاع رساني علوم و فناوري - A hierarchical dependence check and folded rename mapping based scalable dispatch stage

DocumentCode :

1602910

Title :

A hierarchical dependence check and folded rename mapping based scalable dispatch stage

Author :

Sankaranarayanan, Vadhiraj ; Tyagi, Akhilesh

Author_Institution :

Dept. of ECE, Iowa State Univ., Ames, IA, USA

fYear :

2001

fDate :

6/23/1905 12:00:00 AM

Firstpage :

249

Lastpage :

254

Abstract :

In a superscalar pipeline, the dispatch stage performs register renaming, which involves map table lookup logic and dependence check logic. Both subtasks do not scale well with the dispatch width of the processor. The number of comparators necessary for the dependence check logic grows quadratically with the dispatch width of the processor. The rename map table´s word line capacitance scales linearly with the dispatch width. This paper proposes and evaluates schemes to alleviate both these problems. By performing the dependence check hierarchically in two stages, the number of comparators required in the dependence check logic is reduced from quadratic to linear in the dispatch width. This scheme is also scalable with the dispatch width by allowing a dispatch of DW² instructions in the same processor cycle time that the current microprocessors use to dispatch DW instructions. Simple scalar based simulations indicate a performance penalty of less than 10% over Spec95 CPU benchmarks due to the extra cycle introduced. The second scheme started with an objective of utilizing speculation in rename and dependence information. The only beneficial subspace of this speculation appears to be the reuse of rename information of those instructions whose source operands are produced either in their own basic block or in the immediately preceding basic block. By storing rename information of such instructions in a rename cache, these instructions can be dispatched directly to the reservation stations if the program takes the same path again. The performance improvement due to the rename cache is approximately 7% for SPEC95 integer benchmarks

Keywords :

cache storage; parallel architectures; performance evaluation; pipeline processing; dependence check; instruction-level-parallelism; integer benchmarks; microarchitecture; non-scalable bottlenecks; performance improvement; performance penalty; register rename logic; register renaming; rename cache; rename information; scalable dispatch stage; superscalar pipeline; Capacitance; Delay lines; Logic design; Microarchitecture; Microprocessors; Pipelines; Registers; Scalability; Table lookup;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Design, 2001. ICCD 2001. Proceedings. 2001 International Conference on

Conference_Location :

Austin, TX

ISSN :

1063-6404

Print_ISBN :

0-7695-1200-3

Type :

conf

DOI :

10.1109/ICCD.2001.955036

Filename :

955036

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1602910