مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

1197866

Title :

Scalar operand networks

Author :

Taylor, Michael Bedford ; Lee, Walter ; Amarasinghe, Saman P. ; Agarwal, Anant

Author_Institution :

CSAIL, Cambridge, MA, USA

Volume :

Issue :

fYear :

2005

Firstpage :

145

Lastpage :

162

Abstract :

The bypass paths and multiported register files in microprocessors serve as an implicit interconnect to communicate operand values among pipeline stages and multiple ALUs. Previous superscalar designs implemented this interconnect using centralized structures that do not scale with increasing ILP demands. In search of scalability, recent microprocessor designs in industry and academia exhibit a trend toward distributed resources such as partitioned register files, banked caches, multiple independent compute pipelines, and even multiple program counters. Some of these partitioned microprocessor designs have begun to implement bypassing and operand transport using point-to-point interconnects. We call interconnects optimized for scalar data transport, whether centralized or distributed, scalar operand networks. Although these networks share many of the challenges of multiprocessor networks such as scalability and deadlock avoidance, they have many unique requirements, including ultra-low latency (a few cycles versus tens of cycles) and ultra-fast operation-operand matching. This work discusses the unique properties of scalar operand networks (SONs), examines alternative ways of implementing them, and introduces the AsTrO taxonomy to distinguish between them. It discusses the design of two alternative networks in the context of the Raw microprocessor, and presents timing, area, and energy statistics for a real implementation. The paper also presents a 5-tuple performance model for SONs and analyzes their performance sensitivity to network properties for ILP workloads.

Keywords :

computer architecture; instruction sets; logic design; microprocessor chips; multiprocessor interconnection networks; parallel processing; statistics; 5-tuple performance model; AsTrO taxonomy; ILP workloads; Raw microprocessor; banked caches; bypass paths; distributed architecture; energy statistics; interconnection architecture; microprocessor design; multiple independent compute pipelines; multiported register files; multiprocessor networks; scalar data transport; scalar operand networks; ultra-fast operation-operand matching; Computer industry; Counting circuits; Delay; Distributed computing; Microprocessors; Pipelines; Registers; Scalability; System recovery; Taxonomy;

fLanguage :

English

Journal_Title :

Parallel and Distributed Systems, IEEE Transactions on

Publisher :

ieee

ISSN :

1045-9219

Type :

jour

DOI :

10.1109/TPDS.2005.24

Filename :

1374855

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1197866