Title :
Custom Built Heterogeneous Multi-core Architectures (CUBEMACH): Breaking the conventions
Author :
Venkateswaran, Nagarajan ; Saravanan, Karthikeyan Palavedu ; Nachiappan, Nachiappan Chidambaram ; Vasudevan, Aravind ; Subramaniam, Balaji ; Mukundarajan, Ravindhiran
Author_Institution :
Waran Res. Found., Chennai, India
Abstract :
Increasing computational demand has stirred node architectures to move towards SuperComputer-On-Chips(SCOCs), where computational efficiency is emphasized over peak performance. Suitability of the architecture to a wider class of applications is becoming an pre-eminent design constraint for future HPC systems. This paper explores a novel design paradigm, the Custom Built Heterogeneous Multi-core Architectures (CUBEMACH) for realizing future generation node architectures. The CUBEMACH design flavoring a set of applications offers the possibility of increased resource utilization, which is exploited by running traces of multiple independent applications within a node without time or space sharing. A wide variety of complex Algorithm Level Functional units (ALFUs) besides scalars are used to meet high performance requirements of the grand challenge applications. To cater to the high communication bandwidth requirements across heterogeneous cores comprising these Algorithm Level Functional Units, a novel hierarchical communication backbone structure referred to as the On-Node-Network (ONNET) is used. The demand for high instruction issue rate due to the presence of a large number Algorithm Level Functional Units is catered by an Hardware based Compiler-On-Silicon(COS). The cost-effectiveness is achieved due to the the fact that the CUBEMACH design paradigm helps create an architecture for a single user for executing multiple independent applications without space time sharing.The cost effectiveness of implementing the CUBEMACH Design Paradigm is also achieved by developing SCOC IP cores for Higher Level Functional Us, Compiler-On-Silicon and On-Node-Network.
Keywords :
computer architecture; microprocessor chips; multiprocessing systems; resource allocation; CUBEMACH design; HPC system; SCOC IP core; algorithm level functional units; communication bandwidth requirement; computational efficiency; cost-effectiveness; custom built heterogeneous multicore architecture; design paradigm; future generation node architecture; hardware based compiler-on-silicon; hierarchical communication backbone structure; on-node-network; resource utilization; supercomputer-on-chips; Application software; Bandwidth; Brain modeling; Computational efficiency; Computer architecture; Costs; Hardware; Parallel processing; Resource management; Spine; Algorithm Level Functional Units (HLFU); CUBEMACH design; Compiler-On-Silicon; On Node Network (ONNET); SCOC IP Cores; SuperComputer-On-Chip (SCOC);
Conference_Titel :
Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010 IEEE International Symposium on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-6533-0
DOI :
10.1109/IPDPSW.2010.5470871