DocumentCode
3015928
Title
Inherently Workload-Balanced Clustered Microarchitecture
Author
Abella, Jaume ; González, Antonio
Author_Institution
Comput. Archit. Dept., Univ. Politecnica de Catalunya, Barcelona, Spain
fYear
2005
fDate
04-08 April 2005
Abstract
The performance of clustered microarchitectures relies on steering schemes that try to find the best trade-off between workload balance and inter-cluster communication penalties. In previously proposed clustered processors, reducing communication penalties and balancing the workload are opposite targets, since improving one usually implies a detriment in the other. In this paper we propose a new clustered microarchitecture that can minimize communication penalties without compromising workload balance. The key idea is to arrange the clusters in a ring topology in such a way that results of one cluster can be forwarded to the neighbor cluster with a very short latency. In this way, minimizing communication penalties is favored when the producer of a value and its consumer are placed in adjacent clusters, which also favors workload balance. The proposed microarchitecture is shown to outperform a state-of-the-art clustered processor. For instance, for an 8-cluster configuration and just one fully pipelined unidirectional bus, 15% speedup is achieved on average for FP programs.
Keywords
parallel architectures; pipeline processing; resource allocation; inter-cluster communication penalties; pipelined unidirectional bus; state-of-the-art clustered processor; workload-balanced clustered microarchitecture; Clocks; Computer architecture; Delay effects; Energy consumption; Microarchitecture; Microprocessors; Pipelines; Process design; Topology; Wire;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN
0-7695-2312-9
Type
conf
DOI
10.1109/IPDPS.2005.258
Filename
1419837
Link To Document