Title :
Clustered multithreaded architectures - pursuing both IPC and cycle time
Author :
Collins, Jamison D. ; Tullsen, Dean M.
Author_Institution :
Dept. of Comput. Sci. & Eng., California Univ., San Diego, La Jolla, CA, USA
Abstract :
Summary form only given. Clustering is an architectural technique that allows the design of wide superscalar processors without sacrificing cycle time, but at the cost of longer communication latencies. Simultaneous multithreading architectures effectively tolerate instruction latency, but put even more pressure on timing-critical processor resources. We show that the synergistic combination of the two techniques minimizes the IPC impact of the clustered architecture, and even permits more aggressive clustering of the processor than is possible with a single-threaded processor. Additionally, we show that multithreading enables effective instruction steering policies unavailable to a single-threaded clustered architecture. We explore the impact of aggressively clustering four complex processor structures, (1) instruction window wakeup and functional unit bypass logic, (2) register renaming logic, (3) the fetch unit, and (4) the integer register file, on a simultaneous multithreading processor.
Keywords :
instruction sets; multi-threading; multiprocessing systems; IPC; clustered multithreaded architecture; cycle tune; fetch unit; functional unit bypass logic; instruction window wakeup; integer register file; register renaming logic; simultaneous multithreading processor; single-threaded clustered architecture; timing-critical processor resources; tolerate instruction latency; wide superscalar processor; Clocks; Costs; Delay; Hardware; Logic; Multithreading; Pipelines; Registers; Surface-mount technology; Throughput;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
DOI :
10.1109/IPDPS.2004.1303010