Title :
MIC-SVM: Designing a Highly Efficient Support Vector Machine for Advanced Modern Multi-core and Many-Core Architectures
Author :
Yang You ; Song, Shuaiwen Leon ; Haohuan Fu ; Marquez, Andres ; Dehnavi, Maryam Mehri ; Barker, Kevin ; Cameron, Kirk W. ; Randles, Amanda Peters ; Guangwen Yang
Author_Institution :
Center for Earth Sci., Tsinghua Univ., Beijing, China
Abstract :
Support Vector Machine (SVM) has been widely used in data-mining and Big Data applications as modern commercial databases start to attach an increasing importance to the analytic capabilities. In recent years, SVM was adapted to the field of High Performance Computing for power/performance prediction, auto-tuning, and runtime scheduling. However, even at the risk of losing prediction accuracy due to insufficient runtime information, researchers can only afford to apply offline model training to avoid significant runtime training overhead. Advanced multi- and many-core architectures offer massive parallelism with complex memory hierarchies which can make runtime training possible, but form a barrier to efficient parallel SVM design. To address the challenges above, we designed and implemented MIC-SVM, a highly efficient parallel SVM for x86 based multi-core and many-core architectures, such as the Intel Ivy Bridge CPUs and Intel Xeon Phi co-processor (MIC). We propose various novel analysis methods and optimization techniques to fully utilize the multilevel parallelism provided by these architectures and serve as general optimization methods for other machine learning tools. MIC-SVM achieves 4.4-84x and 18-47x speedups against the popular LIBSVM, on MIC and Ivy Bridge CPUs respectively, for several real-world data-mining datasets. Even compared with GPUSVM, run on a top of the line NVIDIA k20x GPU, the performance of our MIC-SVM is competitive. We also conduct a cross-platform performance comparison analysis, focusing on Ivy Bridge CPUs, MIC and GPUs, and provide insights on how to select the most suitable advanced architectures for specific algorithms and input data patterns.
Keywords :
Big Data; data mining; learning (artificial intelligence); microprocessor chips; multiprocessing systems; optimisation; parallel processing; scheduling; support vector machines; GPUSVM; Intel Ivy Bridge CPUs; Intel Xeon Phi coprocessor; LIBSVM; MIC-SVM; advanced manycore architectures; advanced multicore architectures; big data applications; commercial databases; data-mining applications; data-mining datasets; high performance computing; line NVIDIA k20x GPU; machine learning tools; parallel SVM design; performance prediction; runtime scheduling; support vector machine; x86 based multicore architectures; x86 manycore architectures; Bridges; Computer architecture; Equations; Microwave integrated circuits; Parallel processing; Support vector machines; Training; Multi-& Many-Core architectures; Optimization Techniques; Parallelization; Performance Analysis; Support Vector Machine;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2014 IEEE 28th International
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4799-3799-8
DOI :
10.1109/IPDPS.2014.88