DocumentCode :
3206300
Title :
An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU
Author :
Davidson, Andrew ; Zhang, Yao ; Owens, John D.
Author_Institution :
Univ. of California, Davis, CA, USA
fYear :
2011
fDate :
16-20 May 2011
Firstpage :
956
Lastpage :
965
Abstract :
We present a multi-stage method for solving large tridiagonal systems on the GPU. Previously large tridiagonal systems cannot be efficiently solved due to the limitation of on-chip shared memory size. We tackle this problem by splitting the systems into smaller ones and then solving them on-chip. The multi-stage characteristic of our method, together with various workloads and GPUs of different capabilities, obligates an auto-tuning strategy to carefully select the switch points between computation stages. In particular, we show two ways to effectively prune the tuning space and thus avoid an impractical exhaustive search: (1) apply algorithmic knowledge to decouple tuning parameters, and (2) estimate search starting points based on GPU architecture parameters. We demonstrate that auto-tuning is a powerful tool that improves the performance by up to 5x, saves 17% and 32% of execution time on average respectively over static and dynamic tuning, and enables our multi-stage solver to outperform the Intel MKL tridiagonal solver on many parallel tridiagonal systems by 6-11x.
Keywords :
computer graphic equipment; coprocessors; GPU architecture parameters; Intel MKL tridiagonal solver; autotuned method; dynamic tuning; graphic processing unit; multistage method; on-chip shared memory size; parallel tridiagonal systems; search starting point estimation; static tuning; tuning space; Graphics processing unit; Instruction sets; Kernel; Memory management; Parallel processing; Switches; Tuning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
Conference_Location :
Anchorage, AK
ISSN :
1530-2075
Print_ISBN :
978-1-61284-372-8
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2011.92
Filename :
6012904
Link To Document :
بازگشت