Title :
A New Look at Compressed Ordinary Least Squares
Author_Institution :
Sch. of Comput. Sci., Univ. of Birmingham, Birmingham, UK
Abstract :
The prospect of carrying out data mining on cheaply compressed versions of high dimensional massive data sets holds tremendous potential and promise. However, our understanding of the performance guarantees available from such computationally inexpensive dimensionality reduction methods for data mining and machine learning tasks is currently lagging behind the requirements. In this paper we take a new look at randomly projected ordinary least squares regression, and give improved bounds on its expected excess risk. Our bounds are derived from first principles and use elementary techniques.
Keywords :
data compression; data mining; learning (artificial intelligence); least squares approximations; regression analysis; cheaply compressed data; compressed ordinary least squares; computationally inexpensive dimensionality reduction methods; data mining; elementary techniques; first principles; high dimensional massive data sets; machine learning; randomly projected ordinary least squares regression; Aerospace electronics; Compressed sensing; Data mining; Linear regression; Matrix decomposition; Sparse matrices; Vectors; OLS regression; Random Projections; excess risk bounds;
Conference_Titel :
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4799-3143-9
DOI :
10.1109/ICDMW.2013.152