DocumentCode
2187441
Title
Application of the OpenCL API for Implementation of the NIPALS Algorithm for Principal Component Analysis of Large Data Sets
Author
Bowden, Joshua C.
Author_Institution
CSIRO Inf. Manage. & Technol., Clayton, VIC, Australia
fYear
2010
fDate
7-10 Dec. 2010
Firstpage
25
Lastpage
30
Abstract
An implementation of the nonlinear iterative partial least squares algorithm (NIPALS) was used as a test case for use of OpenCL for computation on a general purpose graphics processing unit (GPGPU) cluster using MPI. Timing results are shown along with results of a model of time required per iteration for defined problem sizes. Various steps in optimization of the code are discussed, moving from use of a single GPU, to multiple GPUs on a single node, to multiple GPUs on multiple nodes. Comparison of performance between OpenCL and BLAS implementations, modern CPU architectures and NVidia Tesla and Fermi class GPU systems are given.
Keywords
application program interfaces; computer graphic equipment; coprocessors; iterative methods; least squares approximations; principal component analysis; BLAS; CPU architectures; Fermi class GPU systems; GPU; MPI; NIPALS; NVidia Tesla; OpenCL API; general purpose graphics processing unit; nonlinear iterative partial least squares algorithm; optimization; principal component analysis; Algorithm design and analysis; Graphics processing unit; Instruction sets; Memory management; Pixel; Timing; Workstations; PCA; principal component analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
e-Science Workshops, 2010 Sixth IEEE International Conference on
Conference_Location
Brisbane, QLD
Print_ISBN
978-1-4244-8988-6
Electronic_ISBN
978-0-7695-4295-9
Type
conf
DOI
10.1109/eScienceW.2010.14
Filename
5693137
Link To Document