Title :
OpenCL implementation of Cholesky matrix decomposition
Author :
Brunelli, Claudio ; Aho, Eero ; Berg, Heikki
Author_Institution :
Nokia Res. Center, Tampere, Finland
fDate :
Oct. 31 2011-Nov. 2 2011
Abstract :
This paper presents some OpenCL implementations for Cholesky decomposition, a very popular algorithm used in linear algebra and signal processing applications. The Cholesky algorithm represents a very interesting candidate for OpenCL implementation since it contains sequential parts besides parallel ones. Furthermore, one step involves just a small amount of calculations. These characteristics pose challenges which call for suitable techniques to overcome the limitations of the language. We propose several versions of the implementation of the Cholesky algorithm, then provide an analysis of the trade off between complexity and performance offered by each of them. We also analyze the differences between execution of the program on GPU and on multicore CPU.
Keywords :
computer graphic equipment; coprocessors; mathematics computing; matrix decomposition; multiprocessing systems; Cholesky matrix decomposition; GPU; OpenCL implementation; linear algebra; multicore CPU; signal processing applications; Algorithm design and analysis; Graphics processing unit; Kernel; Matrix decomposition; Multicore processing; Performance evaluation; Synchronization;
Conference_Titel :
System on Chip (SoC), 2011 International Symposium on
Conference_Location :
Tampere
Print_ISBN :
978-1-4577-0671-4
Electronic_ISBN :
978-1-4577-0670-7
DOI :
10.1109/ISSOC.2011.6089694