Title :
Implementation and performance evaluation of Locust
Author :
Verma, Manish ; Chiueh, Tzi-cker
Author_Institution :
Silicon Graphics Comput. Syst., Mountain View, CA, USA
Abstract :
Locust is a distributed shared virtual memory system that exploits compile-time data dependency information to address the issues of false sharing, cache coherence overhead, and affinity process scheduling. The paper reports the results and their analysis of a comprehensive performance evaluation study of the first Locust prototype, which is implemented on a 12-node Pentium cluster running FreeBSD and has been fully operational for a year. The results show that for the set of regular programs tested, the performance of Locust is within 1-8% of that of the message passing system implemented on the same hardware/software platform. The main performance gain of Locust as compared to existing weak cache consistency models mainly comes from the elimination of unnecessary synchronizations using the generational cache coherence protocol, and the function-shipping approach of implementing synchronization operations
Keywords :
cache storage; distributed memory systems; parallel programming; processor scheduling; protocols; shared memory systems; software performance evaluation; synchronisation; virtual storage; FreeBSD; Locust; Pentium cluster; affinity process scheduling; cache coherence overhead; compile-time data dependency information; distributed shared virtual memory system; false sharing; function-shipping approach; generational cache coherence protocol; message passing system; performance evaluation; synchronization; weak cache consistency models; Coherence; Hardware; Message passing; Performance analysis; Performance gain; Prototypes; Software performance; Software prototyping; Software testing; System testing;
Conference_Titel :
Parallel Processing, 1998. Proceedings. 1998 International Conference on
Conference_Location :
Minneapolis, MN
Print_ISBN :
0-8186-8650-2
DOI :
10.1109/ICPP.1998.708468