DocumentCode :
2984887
Title :
Parallelization with Multiplicative Algorithms for Big Data Mining
Author :
Dijun Luo ; Ding, Chibiao ; Heng Huang
Author_Institution :
Comput. Sci. & Eng. Dept., Univ. of Texas at Arlington, Arlington, TX, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
489
Lastpage :
498
Abstract :
We propose a nontrivial strategy to parallelize a series of data mining and machine learning problems, including 1-class and 2-class support vector machines, nonnegative least square problems, and $ell_1$ regularized regression (LASSO) problems. Our strategy fortunately leads to extremely simple multiplicative algorithms which can be straightforwardly implemented in parallel computational environments, such as Map Reduce, or CUDA. We provide rigorous analysis of the correctness and convergence of the algorithm. We demonstrate the scalability and accuracy of our algorithms in comparison with other current leading algorithms.
Keywords :
data mining; learning (artificial intelligence); regression analysis; support vector machines; 1-class support vector machine; 2-class support vector machine; CUDA; LASSO problem; Map Reduce; data mining; machine learning problem; multiplicative algorithm; nonnegative least square problem; nontrivial strategy; parallel computational environment; regularized regression; Algorithm design and analysis; Convergence; Data mining; Graphics processing units; Machine learning algorithms; Optimization; Support vector machines; Big Data; CUDA; LASSO; MapReduce; Support Vector Machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
ISSN :
1550-4786
Print_ISBN :
978-1-4673-4649-8
Type :
conf
DOI :
10.1109/ICDM.2012.155
Filename :
6413876
Link To Document :
بازگشت