Title :
A distributed processor state management architecture for large-window processors
Author :
González, Isidro ; Galluzzi, Marco ; Veidenbaum, Alex ; Ramírez, Yong A. ; Cristal, Adrián ; Valero, Mateo
Author_Institution :
Dept. of Comput. Archit., UPC, Barcelona
Abstract :
Processor architectures with large instruction windows have been proposed to expose more instruction-level parallelism (ILP) and increase performance. Some of the proposed architectures replace a re-order buffer (ROB) with a check-pointing mechanism and an out-of-order release of processor resources. Check-pointing, however, leads to an imprecise processor state recovery on mis-predicted branches and exceptions and re-execution of correct-path instructions after state recovery. It also requires large register files complicating renaming, allocation and release of physical registers. This paper proposes a new processor architecture called a Multi-State Processor (MSP). The MSP does not use check-pointing, avoids the above-mentioned problems, and has a fast, distributed state recovery mechanism. The MSP uses a novel register management architecture allowing implementation of large register files with simpler and more scalable register allocation, renaming, and release. It is also key to precise processor state recovery mechanism. The MSP is shown to improve IPC by 14%, on average, for integer SPEC CPU2000 benchmarks compared to a check-pointing based mechanism ([2]) when a fast and simple branch predictor is used. With a very aggressive branch predictor the IPC improvement is 1%, on average, and 3% if some of the programs are optimized for the MSP. The MSP also reduces the average number of executed instructions by 16.5% (12% for the aggressive branch predictor), mostly due to precise state recovery. This improves the MSP processor energy efficiency even though it uses a larger register file.
Keywords :
checkpointing; file organisation; parallel architectures; checkpointing mechanism; correct path instructions; distributed processor state management architecture; fast distributed state recovery; instruction-level parallelism; large instruction windows; large register files; large-window processors; mispredicted branches; multistate processor; processor architecture; processor resources; processor state recovery; register management architecture; reorder buffer; scalable register allocation; scalable register release; scalable register renaming; Checkpointing; Cities and towns; Computer aided instruction; Computer architecture; Computer science; Concurrent computing; Distributed computing; Parallel processing; Registers; Resumes; Check-pointing; component; large-window; misprediction recovery; register file; register renaming;
Conference_Titel :
Microarchitecture, 2008. MICRO-41. 2008 41st IEEE/ACM International Symposium on
Conference_Location :
Lake Como
Print_ISBN :
978-1-4244-2836-6
Electronic_ISBN :
1072-4451
DOI :
10.1109/MICRO.2008.4771775