Title :
Message passing for Linux clusters with gigabit Ethernet mesh connections
Author :
Chen, Jie ; Watson, William, III ; Edwards, Robert ; Mao, Weizhen
Author_Institution :
HPC Group, Jefferson Lab., Newport News, VA, USA
Abstract :
Multiple copper-based commodity gigabit Ethernet (GigE) interconnects (adapters) on a single host can lead to Linux clusters with mesh/torus connections without using expensive switches and high speed network interconnects (NICs). However traditional message passing systems based on TCP for GigE cannot perform well for this type of clusters because of the overhead of TCP for multiple GigE links. In this paper, we present two os-bypass message passing systems that are based on a modified M-VIA (an implementation of VIA specification) for two production GigE mesh clusters: one is constructed as a 4x8x8 (256 nodes) torus and has been in production use for a year; the other is constructed as a 6×8×8 (384 nodes) torus and was deployed recently. One of the message passing systems targets to a specific application domain and is called QMP and the other is an implementation of MPI specification 1.1. The GigE mesh clusters using these two message passing systems achieve about 18.5 μs half-way round trip latency and 400MB/s total bandwidth, which compare reasonably well to systems using specialized high speed adapters in a switched architecture at much lower costs.
Keywords :
LAN interconnection; Linux; application program interfaces; formal specification; message passing; network interfaces; transport protocols; workstation clusters; Linux cluster; M-VIA specification; MPI specification; TCP; copper-based commodity gigabit Ethernet interconnect; high speed network interconnect; mesh connection; message passing system; switched architecture; Bandwidth; Communication switching; Computer architecture; Delay; Ethernet networks; Linux; Message passing; Network topology; Production systems; Switches;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN :
0-7695-2312-9
DOI :
10.1109/IPDPS.2005.287