• DocumentCode
    451150
  • Title

    Architectural Requirements and Scalability of the NAS Parallel Benchmarks

  • Author

    Wong, Frederick C. ; Martin, Richard P. ; Arpaci-Dusseau, Remzi H. ; Culler, David E.

  • Author_Institution
    University of California, Berkeley
  • fYear
    1999
  • fDate
    13-18 Nov. 1999
  • Firstpage
    41
  • Lastpage
    41
  • Abstract
    We present a study of the architectural requirements and scalability of the NAS Parallel Benchmarks. Through direct measurements and simulations, we identify the factors which affect the scalability of benchmark codes on two relevant and distinct platforms; a cluster of workstations and a ccNUMA SGI Origin 2000. We find that the benefit of increased global cache size is pronounced in certain applications and often offsets the communication cost. By constructing the working set profile of the benchmarks, we are able to visualize the improvement of computational efficiency under constant-problem-size scaling. We also find that, while the Origin MPI has better point-to-point performance, the cluster MPI layer is more scalable with communication load. However, communication performance within the applications is often much lower than what would be achieved by micro-benchmarks. We show that the communication protocols used by MPI runtime library are influential to the communication performance in applications, and that the benchmark codes have a wide spectrum of communication requirements.
  • Keywords
    Computational efficiency; Computational modeling; Computer science; Costs; Parallel machines; Protocols; Runtime library; Scalability; Visualization; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, ACM/IEEE 1999 Conference
  • Print_ISBN
    1-58113-091-0
  • Type

    conf

  • DOI
    10.1109/SC.1999.10044
  • Filename
    1592684