Title :
High-dimensional analysis of semidefinite relaxations for sparse principal components
Author :
Amini, Arash A. ; Wainwright, Martin J.
Author_Institution :
Dept. of EECS, UC Berkeley, Berkeley, CA
Abstract :
In problem of sparse principal components analysis (SPCA), the goal is to use n i.i.d. samples to estimate the leading eigenvector(s) of a p times p covariance matrix, which are known a priori to be sparse, say with at most k non-zero entries. This paper studies SPCA in the high-dimensional regime, where the model dimension p, sparsity index k, and sample size n all tend to infinity simultaneously. We first analyze a simple and computationally inexpensive diagonal cut-off method, and establish a threshold of the order thetasdiag = n/[k2 log(p-k)] separating success from failure. We then analyze a more complex semidefinite programming (SDP) relaxation due to dpsilaAspremont et al., and prove that it succeeds once the sample size is of the order thetassdp = n/[k log(p - k)]. Our results thus highlight an interesting tradeoff between statistical and computational efficiency in high-dimensional estimation problems.
Keywords :
covariance matrices; eigenvalues and eigenfunctions; mathematical programming; principal component analysis; covariance matrix; eigenvector; relaxation; semidefinite programming; sparse principal components analysis; Compressed sensing; Computational efficiency; Covariance matrix; Eigenvalues and eigenfunctions; Electric breakdown; Failure analysis; H infinity control; Principal component analysis; Relaxation methods; Statistical analysis;
Conference_Titel :
Information Theory, 2008. ISIT 2008. IEEE International Symposium on
Conference_Location :
Toronto, ON
Print_ISBN :
978-1-4244-2256-2
Electronic_ISBN :
978-1-4244-2257-9
DOI :
10.1109/ISIT.2008.4595432