DocumentCode
1243202
Title
Mesh-of-Trees and Alternative Interconnection Networks for Single-Chip Parallelism
Author
Balkan, Aydin O. ; Qu, Gang ; Vishkin, Uzi
Author_Institution
Electr. & Comput. Eng. Dept., Univ. of Maryland, College Park, MD, USA
Volume
17
Issue
10
fYear
2009
Firstpage
1419
Lastpage
1432
Abstract
In single-chip parallel processors, it is crucial to implement a high-throughput low-latency interconnection network to connect the on-chip components, especially the processing units and the memory units. In this paper, we propose a new mesh of trees (MoT) implementation of the interconnection network and evaluate it relative to metrics such as wire complexity, total register count, single switch delay, maximum throughput, tradeoffs between throughput and latency, and post-layout performance. We show that on-chip interconnection networks can provide higher bandwidth between processors and shared first-level cache than previously considered possible, facilitating greater scalability of memory architectures that require that. MoT is also compared, both analytically and experimentally, to some other traditional network topologies, such as hypercube, butterfly, fat trees and butterfly fat trees. When we evaluate a 64-terminal MoT network at 90-nm technology, concrete results show that MoT provides higher throughput and lower latency especially when the input traffic (or the on-chip parallelism) is high, at comparable area. A recurring problem in networking and communication is that of achieving good sustained throughput in contrast to just high theoretical peak performance that does not materialize for typical work loads. Our quantitative results demonstrate a clear advantage of the proposed MoT network in the context of single-chip parallel processing.
Keywords
multiprocessor interconnection networks; network-on-chip; parallel processing; high-throughput low-latency interconnection network; memory units; mesh-of-trees; network topologies; on-chip interconnection network; shared first-level cache; single switch delay; single-chip parallel processor; single-chip parallelism; size 90 nm; wire complexity; Layout; mesh-of-trees; multiprocessor interconnection; multistage interconnection networks; network-on-chip (NoC); parallel processing; topology;
fLanguage
English
Journal_Title
Very Large Scale Integration (VLSI) Systems, IEEE Transactions on
Publisher
ieee
ISSN
1063-8210
Type
jour
DOI
10.1109/TVLSI.2008.2003999
Filename
4815529
Link To Document