Title :
Manifold Learning and Missing Data Recovery through Unsupervised Regression
Author :
Carreira-Perpinan, Miguel A. ; Lu, Zhengdong
Author_Institution :
EECS, Univ. of California, Merced, CA, USA
Abstract :
We propose an algorithm that, given a high-dimensional dataset with missing values, achieves the distinct goals of learning a nonlinear low-dimensional representation of the data (the dimensionality reduction problem) and reconstructing the missing high-dimensional data (the matrix completion, or imputation, problem). The algorithm follows the Dimensionality Reduction by Unsupervised Regression approach, where one alternately optimizes over the latent coordinates given the reconstruction and projection mappings, and vice versa, but here we also optimize over the missing data, using an efficient, globally convergent Gauss-Newton scheme. We also show how to project or reconstruct test data with missing values. We achieve impressive reconstructions while learning good latent representations in image restoration with 50% missing pixels.
Keywords :
Gaussian processes; data reduction; image restoration; matrix algebra; regression analysis; unsupervised learning; convergent Gauss-Newton scheme; dimensionality reduction; high-dimensional datasets; image restoration; manifold learning; missing data recovery; missing high-dimensional data reconstruction; nonlinear low-dimensional representation; projection mappings; unsupervised regression approach; Image reconstruction; Jacobian matrices; Linear systems; Manifolds; Optimization; Training; Vectors; dimensionality reduction; manifold learning; matrix completion; missing data;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.97