DocumentCode :
2787454
Title :
Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications
Author :
Ma, Wenjing ; Krishnamoorthy, Sriram ; Agrawal, Gagan
Author_Institution :
Pacific Northwest Nat. Lab., Richland, WA, USA
fYear :
2011
fDate :
10-14 Oct. 2011
Firstpage :
181
Lastpage :
182
Abstract :
Auto-tuning has emerged as an important practical method for creating highly optimized code. However, the growing complexity of architectures and applications has resulted in a prohibitively large search space that preclude empirical auto-tuning. Here, we focus on the challenge to auto-tuning presented by applications that require auto-tuning of not just a small number of distinct kernels, but a large number of kernels that exhibit similar computation and memory access characteristics and require optimization over similar problem spaces. We propose an auto-tuning method for tensor contraction functions on GPUs, based on parameterized micro-benchmarks. Using our parameterized micro-benchmarking approach, we obtain a speedup of up to 2 over the version that used default optimizations without auto-tuning.
Keywords :
benchmark testing; graphics processing units; optimisation; GPU; autotuning approach; optimization; parameterized microbenchmarking; tensor contraction functions; Computer architecture; Graphics processing unit; Indexes; Kernel; Optimization; Tensile stress; Tiles; GPU; auto-tuning; optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures and Compilation Techniques (PACT), 2011 International Conference on
Conference_Location :
Galveston, TX
ISSN :
1089-795X
Print_ISBN :
978-1-4577-1794-9
Type :
conf
DOI :
10.1109/PACT.2011.30
Filename :
6113805
Link To Document :
بازگشت