DocumentCode :
2817678
Title :
Scaling application performance on a cache-coherent multiprocessors
Author :
Jiang, Dongming ; Singh, Jaswinder Pal
Author_Institution :
Dept. of Comput. Sci., Princeton Univ., NJ, USA
fYear :
1999
fDate :
1999
Firstpage :
305
Lastpage :
316
Abstract :
Hardware-coherent, distributed shared address space systems are increasingly successful at moderate scale. However, it is unclear whether, or with how much difficulty, the performance of a load-store shared address space programming model scales to large processor counts on real applications. We examine this question using an aggressive case-study machine, the SGI Origin2000, up to 128 processors. We show for the first time that scalable performance can indeed be achieved in this programming model on a wide range of applications, including challenging kernels like FFT. However, this does not come easily, even for applications considered to be already highly optimized, and is very often not simply a matter of increasing problem size. Rather, substantial further application restructuring is often needed, which is usually quite algorithmic in nature. We examine how the restructurings compare with those needed for performance portability to shared virtual memory on clusters, and we comment on common programming guidelines for performance portability and scalability as well as on how the programming difficulty compares with that of explicit message passing. We also examine where applications spend their time on this large machine, the impact of special hardware features that the machine provides, and the impact of mapping to the network topology
Keywords :
message passing; multiprocessing systems; performance evaluation; virtual storage; SGI Origin2000; aggressive case-study machine; cache-coherent multiprocessors; distributed shared address space systems; load-store shared address space programming model; message passing; programming model; scaling application performance; shared virtual memory; Application software; Clustering algorithms; Computer science; Guidelines; Hardware; Memory architecture; Message passing; Network topology; Scalability; Synthetic aperture sonar;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Architecture, 1999. Proceedings of the 26th International Symposium on
Conference_Location :
Atlanta, GA
ISSN :
1063-6897
Print_ISBN :
0-7695-0170-2
Type :
conf
DOI :
10.1109/ISCA.1999.765960
Filename :
765960
Link To Document :
بازگشت