Title :
Myrinet express (MX): Is your interconnect smart ?
Author :
Geoffray, Patrick
Author_Institution :
Myricom, Inc., Arcadia, CA, USA
Abstract :
Ever since the first prototypes more than ten years ago, the high-performance interconnect Myrinet has been designed around a revolutionizing concept: scalability lives on the edges. In other words, the switching element of the fabric must be kept very simple and the processing power moved to the network interfaces (NICs). Myrinet NICs are built around a programmable RISC processor that operates directly in the data path. Myrinet express (MX), a new communication layer, was specifically designed to harness this capacity of customizing the behavior of the hardware in order to improve the performance of common software interfaces such as MPI and Ethernet emulation. MX embeds functions into the NIC to assist and improve the communication protocols on the host, while providing a very short critical path. MX opportunistically runs part of the host protocol closest to the data path, decoupling the progression of the protocol from the execution of the host application. The results show not only an impressive low-level point-to-point performance, but also a sharp improvement of higher-level metrics such as overlapping of communication, computation, and collective communication for MPI. Similar results are seen for high-throughput, low-overhead IP communication. We present the overall design of MX, along with the key technical choices we made and the problems we encountered. We also present various results to show the impact on benchmarks and real world HPC applications.
Keywords :
network interfaces; program processors; reduced instruction set computing; Ethernet emulation; MPI; Myrinet NIC; Myrinet express; common software interfaces; communication layer; communication protocols; critical path; data path; high-performance interconnect; high-throughput low-overhead IP communication; network interfaces; processing power; programmable RISC processor; real world HPC applications; switching element; Communication switching; Ethernet networks; Fabrics; Hardware; Network interfaces; Protocols; Prototypes; Reduced instruction set computing; Scalability; Software performance;
Conference_Titel :
High Performance Computing and Grid in Asia Pacific Region, 2004. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-2138-X
DOI :
10.1109/HPCASIA.2004.1324074