Title : 
Corrupted and missing predictors: Minimax bounds for high-dimensional linear regression
         
        
            Author : 
Loh, Po-Ling ; Wainwright, Martin J.
         
        
            Author_Institution : 
Dept. of Stat., Univ. of California, Berkeley, Berkeley, CA, USA
         
        
        
        
        
        
            Abstract : 
Missing and corrupted data are ubiquitous in many science and engineering domains. We analyze the information-theoretic limits of recovering sparse vectors under various models of corrupted and missing data. In particular, consider a high-dimensional linear regression model y = X β* + ϵ, where y ∈ Rn is the response vector, X ∈ RnXp is a random design matrix with p ≫ n and rows distributed i.i.d. as N(0, Σx), β* ∈ Rp is the unknown regression vector, and ϵ ~ N(0,σϵ2I) is independent additive noise. Whereas a traditional approach assumes that the covariates X are fully observed, we assume only that a corrupted version Z is observed. Our main contribution is to establish minimax rates of convergence for estimating β* in squared ℓ2-loss, assuming β* is k-sparse. Our upper and lower bounds in both additive noise and missing data cases scale as k log(p/k)/n, with prefactors depending only on the corruption and/or missing pattern of the data.
         
        
            Keywords : 
information theory; matrix algebra; regression analysis; high-dimensional linear regression model; independent additive noise; information-theoretic analysis; lower bounds; minimax bounds; random design matrix; sparse vector recovery; squared ℓ2-loss; upper bounds; Additive noise; Equations; Estimation; Linear regression; Standards; Upper bound; Vectors;
         
        
        
        
            Conference_Titel : 
Information Theory Proceedings (ISIT), 2012 IEEE International Symposium on
         
        
            Conference_Location : 
Cambridge, MA
         
        
        
            Print_ISBN : 
978-1-4673-2580-6
         
        
            Electronic_ISBN : 
2157-8095
         
        
        
            DOI : 
10.1109/ISIT.2012.6283989