DocumentCode :
2570882
Title :
A Simple Performance Model for Multithreaded Applications Executing on Non-uniform Memory Access Computers
Author :
Yang, R. ; Antony, J. ; Rendell, A.P.
Author_Institution :
Sch. of Comput. Sci., Australian Nat. Univ., Canberra, ACT, Australia
fYear :
2009
fDate :
25-27 June 2009
Firstpage :
79
Lastpage :
86
Abstract :
In this work, we extend and evaluate a simple performance model to account for NUMA and bandwidth effects for single and multi-threaded calculations within the Gaussian 03 computational chemistry code on a contemporary multi-core, NUMA platform. By using the thread and memory placement APIs in Solaris, we present results for a set of calculations from which we analyze on-chip interconnect and intra-core bandwidth contention and show the importance of load-balancing between threads. The extended model predicts single threaded performance to within 1% errors and most multi-threaded experiments within 15% errors. Our results and modeling shows that accounting for bandwidth constraints within user-space code is beneficial.
Keywords :
application program interfaces; chemistry computing; multi-threading; resource allocation; shared memory systems; Solaris memory placement API; Solaris thread placement API; computational chemistry code; intracore bandwidth contention; load balancing; multicore NUMA platform; multithreaded application; nonuniform memory access computer; on-chip interconnect; performance model; Analytical models; Application software; Bandwidth; Chemistry; Computer science; Hardware; High performance computing; Predictive models; Supercomputers; Yarn; NUMA; performance modelling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communications, 2009. HPCC '09. 11th IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4244-4600-1
Electronic_ISBN :
978-0-7695-3738-2
Type :
conf
DOI :
10.1109/HPCC.2009.39
Filename :
5166979
Link To Document :
بازگشت