Title :
Tool-assisted Optimization of Shared-memory Accesses in UPC Applications
Author :
Cong, Guojing ; Wen, Huifang ; Murata, Hiroki ; Negishi, Yasushi
Author_Institution :
IBM TJ Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
UPC is designed to improve user productivity when programming distributed-memory machines. Yet the shared-memory abstraction also makes performance analysis hard as it introduces extra overhead with local accesses and implicit communication with remote ones. As far as we know, there are no mature software utilities for systematic analysis and tuning of shared-memory access performance in UPC programs. We develop a mechanism to track shared memory accesses and correlate them to the UPC source lines, functions, and data structures. We then apply tool-assisted analysis to a set of UPC programs. For the NAS UPC benchmark we achieve dramatic performance improvement over the unoptimized implementation as well as up to two times speedups over the fully hand-tuned implementation. We expect our approach effective in tuning a wide range of UPC programs.
Keywords :
C language; data structures; distributed shared memory systems; parallel machines; parallel programming; software performance evaluation; software tools; NAS UPC benchmark; PGAS; UPC programs; UPC source lines; Unified Parallel C; data structures; distributed-memory machine programming; high performance computing systems; partitioned global address space; performance improvement; shared-memory abstraction; shared-memory access tracking; tool-assisted optimization; user productivity improvement; Arrays; Electronics packaging; Optimization; Privatization; Runtime; Tuning; GAS; UPC; performance toolsGAS; performance toolsP;
Conference_Titel :
High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems (HPCC-ICESS), 2012 IEEE 14th International Conference on
Conference_Location :
Liverpool
Print_ISBN :
978-1-4673-2164-8
DOI :
10.1109/HPCC.2012.24