Title :
Efficient SVM Training Using Parallel Primal-Dual Interior Point Method on GPU
Author :
Jing Jin ; Xianggao Cai ; Xiaola Lin
Author_Institution :
Sch. of Inf. Sci. & Technol., Sun Yat-sen Univ., Guangzhou, China
Abstract :
The training of SVM can be viewed as a Convex Quadratic Programming (CQP) problem which becomes difficult to be solved when dealing with the large scale data sets. Traditional methods such as Sequential Minimal Optimization (SMO) for SVM training is used to solve a sequence of small scale sub-problems, which costs a large amount of computation time and is hard to be accelerated by utilizing the computation power of GPU. Although Interior Point Method (IPM) such as primal-dual interior point method (PDIPM) can be also addressed SVM training well and has favourable potential for parallelizing on GPU, it contains comparatively high time complexity O(l^3) and space complexity O(l^2), where l is the number of training instances. Fortunately, by invoking low-rank approximation methods such as Incomplete Cholesky Factorization (ICF) and Sherman Morrison Woodbury formula (SMW), the requirements of both storage and computation of PDIPM can be reduced significantly. In this paper, a parallel PDIPM method (P-PDIPM) along with a parallel ICF method (P-ICF) is proposed to accelerate the SVM training on GPU. Experimental results indicate that the training speed of P-PDIPM on GPU is almost 40x faster than that of the serial one (S-PDIPM) on CPU. Besides, without extensive optimization, P-PDIPM can obtain about 8x speedup over the state of the art tool LIBSVM while maintaining high prediction accuracy.
Keywords :
graphics processing units; learning (artificial intelligence); matrix decomposition; support vector machines; GPU; LIBSVM tool; P-ICF; P-PDIPM; SVM training; incomplete Cholesky factorization; parallel ICF method; parallel primal-dual interior point method; support vector machines; Algorithm design and analysis; Graphics processing units; Instruction sets; Support vector machines; Time complexity; Training; Vectors; Graphic Processing Unit; Incomplete Cholesky Factorization; Parallel Computing; Primal-Dual Interior Point Method; Sherman Morrison Woodbury Formula; Support Vector Machines;
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2013 International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4799-2418-9
DOI :
10.1109/PDCAT.2013.9