Title :
Parallel tree building on a range of shared address space multiprocessors: algorithms and application performance
Author :
Shan, Hongzhang ; Singh, Jaswinder Pal
Author_Institution :
Dept. of Comput. Sci., Princeton Univ., NJ, USA
fDate :
30 Mar-3 Apr 1998
Abstract :
Irregular, particle-based applications that use trees, for example hierarchical N-body applications, are important consumers of multiprocessor cycles, and are argued to benefit greatly in programming ease from a coherent shared address space programming model. As more and more supercomputing platforms that can support different programming models become available to users, from tightly-coupled hardware-coherent machines to clusters of workstations or SMPs, to truly deliver on its ease of programming advantages to application users it is important that the shared address space model not only perform and scale well in the tightly-coupled case but also port well in performance across the range of platforms (as the message passing model can). For tree-based N-body applications, this is currently not true: while the actual computation of interactions ports well, the parallel tree building phase can become a severe bottleneck on coherent shared address space platforms, in particular on platforms with less aggressive, commodity-oriented communication architectures (even though it takes less 3 percent of the time in most sequential executions). The authors therefore investigate the performance of five parallel tree building methods in the context of a complete galaxy simulation on four very different platforms that support this programming model
Keywords :
parallel algorithms; parallel architectures; parallel machines; parallel programming; shared memory systems; software performance evaluation; trees (mathematics); algorithms; application performance; coherent shared address space platforms; coherent shared address space programming model; commodity-oriented communication architectures; complete galaxy simulation; hierarchical N-body applications; interaction computation; irregular particle-based applications; multiprocessor cycles; parallel tree building; programming ease; shared address space multiprocessors; supercomputing platforms; tightly-coupled hardware-coherent machines; trees; workstation clusters; Application software; Buildings; Coherence; Context modeling; Dynamic programming; Hardware; Message passing; Partitioning algorithms; Protocols; Workstations;
Conference_Titel :
Parallel Processing Symposium, 1998. IPPS/SPDP 1998. Proceedings of the First Merged International ... and Symposium on Parallel and Distributed Processing 1998
Conference_Location :
Orlando, FL
Print_ISBN :
0-8186-8404-6
DOI :
10.1109/IPPS.1998.669959