Title :
Multithreading to improve cycle width and CPI in superpipelined superscalar processors
Author :
Goossens, Bernard ; Vu, Duc Thang
Author_Institution :
IBP-LITP, Paris VII Univ., France
Abstract :
This paper presents a multithreaded superpipelined superscalar processor design. It is expected to have a sustained rate of 5.4 instructions run per cycle, with 4 threads on chip. Multithreading serves to improve the superscalar CPI by interleaving threads executions. Operator sharing is used instead of out of order execution. It requires less hardware-no reservation stations, collision vectors or renamed registers-and should offer a greater parallelism potential. Arithmetic operators, including adders, shifters, a multiplier and a step divider, have been pipelined to reduce the processor cycle width to a 16 bits adder propagation delay. Separate and equal lengths data paths controlled by a completely RISC instruction set allow efficient in order issue and termination. Floating point operations are emulated with integer ones with data dependent algorithms providing as good latencies as for traditional hardware implementation. A single register file serves for both the integer and the floating point data
Keywords :
adders; floating point arithmetic; pipeline processing; reduced instruction set computing; 16 bits adder propagation delay; CPI; RISC instruction set; adders; arithmetic operators; cycle width; data dependent algorithms; floating point operations; latencies; multiplier; multithreading; operator sharing; register file; shifters; step divider; superpipelined superscalar processors; Arithmetic; Hardware; Multithreading; Operating systems; Out of order; Parallel processing; Pipeline processing; Propagation delay; Registers; Yarn;
Conference_Titel :
Parallel Architectures, Algorithms, and Networks, 1996. Proceedings., Second International Symposium on
Conference_Location :
Beijing
Print_ISBN :
0-8186-7460-1
DOI :
10.1109/ISPAN.1996.508958