Title :
Scaling towards kilo-core processors with asymmetric high-radix topologies
Author :
Abeyratne, N. ; Das, Ratan ; Qingkun Li ; Sewell, K. ; Giridhar, B. ; Dreslinski, Ronald G. ; Blaauw, D. ; Mudge, Trevor
Author_Institution :
Adv. Comput. Archit. Lab., Univ. of Michigan, Ann Arbor, MI, USA
Abstract :
In this paper, we explore the challenges in scaling on-chip networks towards kilo-core processors. Current low-radix topologies optimize for fast local communication, but do not scale well to kilo-core systems because of the large number of routers required. These increase both power and hop count. In contrast, symmetric high-radix topologies optimize for global communication with fewer hop counts, but degrade local communication with their large, slow routers. To address both local and global communication optimizations independently, we decouple the interconnect design using asymmetric high-radix topologies. By setting a design goal of matching rauter speed with wire speed, our praposed topologies use fast medium-radix rauters to optimize for local communication and a few slow high-radix rauters that reduce hop count to optimize for global communication. Our asymmetric high-radix designs are enabled by recently praposed SwizzleSwitches, which allow us to achieve peiformance scalability within realistic power budgets. We prapose and evaluate two asymmetric high-radix topologies: Super-Star (asymmetric folded Clos) and Super-StarX (asymmetric folded Clos with superimposed mesh). Our evaluations show that the best performing asymmetric high-radix topology improves average network latency over a mesh by 45% while reducing the power consumption by 40%. When compared to symmetric high-radix topologies network thraughput is improved by 2.9× while still praviding similar latency benefits and power ejficiency.
Keywords :
digital arithmetic; integrated circuit design; microprocessor chips; multiprocessor interconnection networks; network routing; network topology; optimisation; performance evaluation; power aware computing; system-on-chip; Super-Star topologies; Super-StarX topologies; Swizzle switches; asymmetric folded Clos topologies; asymmetric high-radix designs; asymmetric high-radix topologies; fast local communication; fast medium-radix routers; global communication optimization; high-radix routers; hop count reduction; interconnect design; kilo-core processors; network latency; network routers; network throughput; on-chip network scaling; performance scalability; power budgets; power consumption reduction; power efficiency; router speed; superimposed mesh; symmetric high-radix topologies; Delays; Global communication; Network topology; Program processors; Throughput; Topology; Wires;
Conference_Titel :
High Performance Computer Architecture (HPCA2013), 2013 IEEE 19th International Symposium on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4673-5585-8
DOI :
10.1109/HPCA.2013.6522344