Title :
Simple Exponential Family PCA
Author :
Jun Li ; Dacheng Tao
Author_Institution :
Centre for Quantum Comput. & Intell. Syst., Univ. of Technol., Sydney, NSW, Australia
Abstract :
Principal component analysis (PCA) is a widely used model for dimensionality reduction. In this paper, we address the problem of determining the intrinsic dimensionality of a general type data population by selecting the number of principal components for a generalized PCA model. In particular, we propose a generalized Bayesian PCA model, which deals with general type data by employing exponential family distributions. Model selection is realized by empirical Bayesian inference of the model. We name the model as simple exponential family PCA (SePCA), since it embraces both the principal of using a simple model for data representation and the practice of using a simplified computational procedure for the inference. Our analysis shows that the empirical Bayesian inference in SePCA formally realizes an intuitive criterion for PCA model selection - a preserved principal component must sufficiently correlate to data variance that is uncorrelated to the other principal components. Experiments on synthetic and real data sets demonstrate effectiveness of SePCA and exemplify its characteristics for model selection.
Keywords :
belief networks; inference mechanisms; principal component analysis; SePCA; data representation; data variance; empirical Bayesian inference; exponential family distributions; general-type data population intrinsic dimensionality determination; generalized Bayesian PCA model; intuitive criterion; principal component analysis; principal component selection; real data sets; simple exponential family PCA; synthetic data sets; Bayesian methods; Complexity theory; Computational modeling; Data models; Estimation; Principal component analysis; Probabilistic logic; Automatic relevance determination; dimensionality reduction; exponential family PCA;
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
DOI :
10.1109/TNNLS.2012.2234134