DocumentCode :
2133094
Title :
DAG-consistent distributed shared memory
Author :
Blumofe, Robert D. ; Frigo, Matteo ; Joerg, Christopher E. ; Leiserson, Charles E. ; Randall, Keith H.
Author_Institution :
Lab. for Comput. Sci., MIT, Cambridge, MA, USA
fYear :
1996
fDate :
15-19 Apr 1996
Firstpage :
132
Lastpage :
141
Abstract :
Introduces DAG (directed acyclic graph) consistency, a relaxed consistency model for distributed shared memory which is suitable for multithreaded programming. We have implemented DAG consistency in software for the Cilk multithreaded runtime system running on a CM5 Connection Machine. Our implementation includes a DAG-consistent distributed cactus stack for storage allocation. We provide empirical evidence of the flexibility and efficiency of DAG consistency for applications that include blocked matrix multiplication, Strassen´s (1969) matrix multiplication algorithm and a Barnes-Hut code. Although Cilk schedules the executions of these programs dynamically, their performances are competitive with statically scheduled implementations in the literature. We also prove that the number FP of page faults incurred by a user program running an P processors can be related to the number F1 of page faults running serially by the formula FP⩽F1+2Cs, where C is the cache size and s is the number of thread migrations executed by Cilk´s scheduler
Keywords :
directed graphs; distributed memory systems; matrix multiplication; paged storage; parallel programming; processor scheduling; shared memory systems; software performance evaluation; storage allocation; Barnes-Hut code; CM5 Connection Machine; Cilk multithreaded runtime system; DAG consistency; blocked matrix multiplication; cache size; directed acyclic graph; distributed cactus stack; distributed shared memory; dynamic scheduling; efficiency; flexibility; multithreaded programming; page faults; program perfomance; relaxed consistency model; statically scheduled implementations; storage allocation; thread migrations; user program; Application software; Assembly; Computer science; Concurrent computing; Dynamic programming; Dynamic scheduling; Laboratories; Load management; Processor scheduling; Runtime; Yarn;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Processing Symposium, 1996., Proceedings of IPPS '96, The 10th International
Conference_Location :
Honolulu, HI
Print_ISBN :
0-8186-7255-2
Type :
conf
DOI :
10.1109/IPPS.1996.508049
Filename :
508049
Link To Document :
بازگشت