DocumentCode :
3024875
Title :
Design of a software distributed shared memory system using an MPI communication layer
Author :
Ojima, Yoshinori ; Sato, Mitsuhisa ; Boku, Taisuke ; Takahashi, Daisuke
Author_Institution :
Graduate Sch. of Syst. & Inf. Eng., Tsukuba Univ., Japan
fYear :
2005
fDate :
7-9 Dec. 2005
Abstract :
We designed and implemented a software distributed shared memory (DSM) system, SCASH-MPI, by using MPI as the communication layer of the SCASH DSM. With MPI as the communication layer, we could use high-speed networks with several clusters and high portability. Furthermore, SCASH-MPI can use high-speed networks with MPI, which is the most commonly available communication library. On the other hand, existing software DSM systems usually use a dedicated communication layer, TCP, or UDP-Ethernet. SCASH-MPI avoids the need for a large amount of pin-down memory for shared memory use that has limited the applications of the original SCASH. In SCASH-MPI, a thread is created to support remote memory communication using MPI. An experiment on a 4-node Itanium cluster showed that the Laplace Solver benchmark using SCASH-MPI achieves a performance comparable to the original SCASH. Performance degradation is only 6.3% in the NPB BT benchmark Class B test. In SCASH-MPI, page transfer does not start until a page fault is detected. To hide the latency of page transmission, we implemented a prefetch function. The latency in BT Class B was reduced by 64% when the prefetch function was used.
Keywords :
distributed shared memory systems; message passing; multi-threading; 4-node Itanium cluster; Laplace Solver benchmark; MPI communication layer; NPB BT benchmark Class B test; SCASH-MPI; TCP; UDP-Ethernet; communication library; high-speed networks; page transmission; prefetch function; software distributed shared memory system design; Application software; Benchmark testing; Communication system software; Delay; High-speed networks; Prefetching; Software design; Software libraries; Software systems; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures,Algorithms and Networks, 2005. ISPAN 2005. Proceedings. 8th International Symposium on
ISSN :
1087-4089
Print_ISBN :
0-7695-2509-1
Type :
conf
DOI :
10.1109/ISPAN.2005.90
Filename :
1575830
Link To Document :
بازگشت