Title :
Increased scalability and power efficiency by using multiple speed pipelines
Author :
Talpes, Emil ; Marculescu, Diana
Author_Institution :
Dept. of Comput. & Electr. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
One of the most important problems faced by microarchitecture designers is the poor scalability of some of the current solutions with increased clock frequencies and wider pipelines. As several studies show, internal processor structures scale differently with decreasing device sizes. While in some cases the access latency is determined by the speed of the logic circuitry, for others it is dominated by the interconnect delay. Furthermore, while some stages can be super-pipelined with relatively small performance loss, others must be kept atomic. This paper proposes a possible solution to this problem, avoiding the traditional trade-off between parallelism and clock speed. First, allowing instructions to enter and leave the Issue Window in an asynchronously manner enables faster speeds in the front-end at the expense of small synchronization latencies. Second, using an Execution Cache for storing instructions that are already scheduled allows for bypassing the issue circuitry and thus clocking the execution core at higher frequencies. Combined, these two mechanisms result in a 50% to 60% performance increase for our test microarchitecture, without requiring a completely new scheduling mechanism. Furthermore, the proposed microarchitecture requires significantly less energy, with 30% reduction in a 0.1 Sum or 20% in a 0.06um process technology over the original baseline.
Keywords :
instruction sets; logic circuits; multiprocessor interconnection networks; performance evaluation; pipeline processing; processor scheduling; program processors; program testing; synchronisation; access latency; execution cache; interconnect delay; internal processor structures; logic circuitry; microarchitecture designers; multiple speed pipelines; power efficiency; scalability; synchronization latency; Clocks; Delay; Frequency; Integrated circuit interconnections; Logic circuits; Logic devices; Microarchitecture; Performance loss; Pipelines; Scalability;
Conference_Titel :
Computer Architecture, 2005. ISCA '05. Proceedings. 32nd International Symposium on
Print_ISBN :
0-7695-2270-X
DOI :
10.1109/ISCA.2005.33