DocumentCode :
2457721
Title :
OpenCL implementation of Cholesky matrix decomposition
Author :
Brunelli, Claudio ; Aho, Eero ; Berg, Heikki
Author_Institution :
Nokia Res. Center, Tampere, Finland
fYear :
2011
fDate :
Oct. 31 2011-Nov. 2 2011
Firstpage :
62
Lastpage :
67
Abstract :
This paper presents some OpenCL implementations for Cholesky decomposition, a very popular algorithm used in linear algebra and signal processing applications. The Cholesky algorithm represents a very interesting candidate for OpenCL implementation since it contains sequential parts besides parallel ones. Furthermore, one step involves just a small amount of calculations. These characteristics pose challenges which call for suitable techniques to overcome the limitations of the language. We propose several versions of the implementation of the Cholesky algorithm, then provide an analysis of the trade off between complexity and performance offered by each of them. We also analyze the differences between execution of the program on GPU and on multicore CPU.
Keywords :
computer graphic equipment; coprocessors; mathematics computing; matrix decomposition; multiprocessing systems; Cholesky matrix decomposition; GPU; OpenCL implementation; linear algebra; multicore CPU; signal processing applications; Algorithm design and analysis; Graphics processing unit; Kernel; Matrix decomposition; Multicore processing; Performance evaluation; Synchronization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System on Chip (SoC), 2011 International Symposium on
Conference_Location :
Tampere
Print_ISBN :
978-1-4577-0671-4
Electronic_ISBN :
978-1-4577-0670-7
Type :
conf
DOI :
10.1109/ISSOC.2011.6089694
Filename :
6089694
Link To Document :
بازگشت