DocumentCode :
2187441
Title :
Application of the OpenCL API for Implementation of the NIPALS Algorithm for Principal Component Analysis of Large Data Sets
Author :
Bowden, Joshua C.
Author_Institution :
CSIRO Inf. Manage. & Technol., Clayton, VIC, Australia
fYear :
2010
fDate :
7-10 Dec. 2010
Firstpage :
25
Lastpage :
30
Abstract :
An implementation of the nonlinear iterative partial least squares algorithm (NIPALS) was used as a test case for use of OpenCL for computation on a general purpose graphics processing unit (GPGPU) cluster using MPI. Timing results are shown along with results of a model of time required per iteration for defined problem sizes. Various steps in optimization of the code are discussed, moving from use of a single GPU, to multiple GPUs on a single node, to multiple GPUs on multiple nodes. Comparison of performance between OpenCL and BLAS implementations, modern CPU architectures and NVidia Tesla and Fermi class GPU systems are given.
Keywords :
application program interfaces; computer graphic equipment; coprocessors; iterative methods; least squares approximations; principal component analysis; BLAS; CPU architectures; Fermi class GPU systems; GPU; MPI; NIPALS; NVidia Tesla; OpenCL API; general purpose graphics processing unit; nonlinear iterative partial least squares algorithm; optimization; principal component analysis; Algorithm design and analysis; Graphics processing unit; Instruction sets; Memory management; Pixel; Timing; Workstations; PCA; principal component analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
e-Science Workshops, 2010 Sixth IEEE International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4244-8988-6
Electronic_ISBN :
978-0-7695-4295-9
Type :
conf
DOI :
10.1109/eScienceW.2010.14
Filename :
5693137
Link To Document :
بازگشت