DocumentCode
1918494
Title
Abstract: Memory and Parallelism Exploration Using the LULESH Proxy Application
Author
Karlin, Ian ; McGraw, Jim ; Gallardo, Esthela ; Keasler, Jeff ; Leon, Edgar A. ; Still, Bert
fYear
2012
fDate
10-16 Nov. 2012
Firstpage
1427
Lastpage
1428
Abstract
Current and planned computer systems present challenges for scientific programming. Memory capacity and bandwidth are limiting performance as floating point capability increases due to more cores per processor and wider vector units. Effectively using hardware requires finding greater parallelism in programs while using relatively less memory. In this poster, we present how we tuned the Livermore Unstructured Lagrange Explicit Shock Hydrodynamics proxy application for on-node performance resulting in 62% fewer memory reads, a 19% smaller memory footprint, 770% more floating point operations vectorizing and less than 0.1% serial section runtime. Tests show serial code version runtime decreases of up to 57% and parallel runtime reductions of up to 75%. We are also applying these optimizations to GPUs and a subset of ALE3D, from which the proxy application was derived. So far we achieve up to a 1.9x speedup on GPUs, and a 13% runtime reduction in the application for the same problem.
Keywords
Data Motion Minimization; Performance Tuning; Proxy-app;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SCC), 2012 SC Companion:
Conference_Location
Salt Lake City, UT
Print_ISBN
978-1-4673-6218-4
Type
conf
DOI
10.1109/SC.Companion.2012.234
Filename
6496017
Link To Document