DocumentCode :
3053216
Title :
Accelerating on-line training of LS-SVM with run-time reconfiguration
Author :
Wang, Shaojun ; Peng, Yu ; Zhao, Guangquan ; Peng, Xiyuan
Author_Institution :
Sch. of Electr. Eng. & Autom., Harbin Inst. of Technol., Harbin, China
fYear :
2011
fDate :
12-14 Dec. 2011
Firstpage :
1
Lastpage :
6
Abstract :
Least Squares Support Vector Machines(LS-SVM), which is an efficient supervised learning tool, has been widely applied to real-time on-line data processing in many fields. However, the on-line training of LS-SVM always suffers from huge computation which greatly limits its practicability especially in embedded systems. By leveraging the flexibility and high degree parallelism offered by reconfigurable fabrics, we propose a Run-Time Reconfiguration(RTR) framework to accelerate the on-line training of LS-SVM. To realize maximum computational parallelism, we divide the training process into two parts, the kernel matrix formulation and the least-square problem solving. We dynamically load these two parts into FPGA with RTR under the control of the embedded PowerPC. In the kernel matrix formulation part, we design a piecewise linear interpolation method to realize the radial basis function. In the least-square problem solving part, the modified Cholesky Decomposition is introduced to avoid the latency caused by square roots operations. The whole design is tested on Virtex XC5VFX130T with a 150MHz clock. The experiments show appealing speed up which ranges from 6~218× over a Xeon CPU implementation on five different sized datasets. From time cost percentage analysis, our proposed architecture can be effectively applied to LS-SVM training in more than 1000 samples applications.
Keywords :
data mining; embedded systems; field programmable gate arrays; learning (artificial intelligence); least squares approximations; matrix algebra; piecewise linear techniques; support vector machines; training; FPGA; LS-SVM; RTR; Virtex XC5VFX130T; Xeon CPU implementation; embedded PowerPC; embedded system; kernel matrix formulation; least squares support vector machine; maximum computational parallelism; modified Cholesky decomposition; online training; piecewise linear interpolation method; radial basis function; real time online data processing; reconfigurable fabrics; run-time reconfiguration framework; supervised learning tool; Computer architecture; Field programmable gate arrays; Hardware; Kernel; Matrix decomposition; Random access memory; Training; Custom Computing; FPGA; LS-SVM; Run Time Reconfiguration;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Field-Programmable Technology (FPT), 2011 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4577-1741-3
Type :
conf
DOI :
10.1109/FPT.2011.6132697
Filename :
6132697
Link To Document :
بازگشت