Title :
Large-Scale Nyström Kernel Matrix Approximation Using Randomized SVD
Author :
Mu Li ; Wei Bi ; Kwok, J.T. ; Bao-Liang Lu
Author_Institution :
Dept. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
The Nyström method is an efficient technique for the eigenvalue decomposition of large kernel matrices. However, to ensure an accurate approximation, a sufficient number of columns have to be sampled. On very large data sets, the singular value decomposition (SVD) step on the resultant data submatrix can quickly dominate the computations and become prohibitive. In this paper, we propose an accurate and scalable Nyström scheme that first samples a large column subset from the input matrix, but then only performs an approximate SVD on the inner submatrix using the recent randomized low-rank matrix approximation algorithms. Theoretical analysis shows that the proposed algorithm is as accurate as the standard Nyström method that directly performs a large SVD on the inner submatrix. On the other hand, its time complexity is only as low as performing a small SVD. Encouraging results are obtained on a number of large-scale data sets for low-rank approximation. Moreover, as the most computational expensive steps can be easily distributed and there is minimal data transfer among the processors, significant speedup can be further obtained with the use of multiprocessor and multi-GPU systems.
Keywords :
eigenvalues and eigenfunctions; graphics processing units; matrix algebra; operating system kernels; singular value decomposition; Nystrom kernel matrix approximation; Nystrom scheme; eigenvalue decomposition; inner submatrix; large kernel matrices; low rank approximation; minimal data transfer; multiGPU systems; multiprocessor; processors; randomized SVD; randomized low rank matrix approximation algorithms; resultant data submatrix; singular value decomposition; standard Nystrom method; Algorithm design and analysis; Approximation algorithms; Approximation methods; Kernel; Matrix decomposition; Standards; Time complexity; Distributed computing; Nyström method; Nystr??m method; graphics processor; large-scale learning; low-rank matrix approximation; randomized SVD; randomized SVD.;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2014.2359798