DocumentCode :
3316409
Title :
The parallel communication protocol in BCL-4
Author :
Zhou, Xiaocheng ; Huo, Zhigang ; Ma, Jie ; Meng, Dan
Author_Institution :
National Res. Center for Intelligent Comput. Syst., Chinese Acad. of Sci., Beijing, China
fYear :
2004
fDate :
20-22 July 2004
Firstpage :
98
Lastpage :
103
Abstract :
As CLUMPS become the main stream of clusters and the number of nodes in a cluster increases, it requires enhancing the bandwidth performance and availability of the communication system used in clusters. Parallel communication based on multiple system area networks (SANs) can fulfill the requirements. This work introduces the parallel communication protocol used in BCL-4, which is a high efficient communication system used in DAWNING-4000A, a large-scale Linux cluster. It dispatches small messages and sub-messages stripped from large messages into multiple SANs and maintains the communication semantics as before. The parallel communication process is transparent to both users and the control program on network interface card (NIC). It also provides an efficient load balance mechanism. Using the parallel communication protocol, BCL-4 provides many key features, such as multiple throughput, high availability, and backward compatibility. The experimental results show that the peak bandwidth of BCL-4 over two Myrinet is 494.7MB/s, which is almost twice of that over one, and that there is only 0.02us overhead of short message at the same time.
Keywords :
message passing; multicast protocols; network interfaces; parallel processing; resource allocation; workstation clusters; BCL-4; CLUMPS; DAWNING-4000A; communication semantics; large-scale Linux cluster; load balancing mechanism; multiple system area networks; network interface card; parallel communication protocol; Availability; Bandwidth; Communication system traffic; Computers; Intelligent systems; Large-scale systems; Parallel processing; Protocols; Storage area networks; Throughput;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Grid in Asia Pacific Region, 2004. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-2138-X
Type :
conf
DOI :
10.1109/HPCASIA.2004.1324022
Filename :
1324022
Link To Document :
بازگشت