DocumentCode :
64158
Title :
Efficient Model Selection for Mixtures of Probabilistic PCA Via Hierarchical BIC
Author :
Jianhua Zhao
Author_Institution :
Sch. of Stat. & Math., Yunnan Univ. of Finance & Econ., Kunming, China
Volume :
44
Issue :
10
fYear :
2014
fDate :
Oct. 2014
Firstpage :
1871
Lastpage :
1883
Abstract :
This paper concerns model selection for mixtures of probabilistic principal component analyzers (MPCA). The well known Bayesian information criterion (BIC) is frequently used for this purpose. However, it is found that BIC penalizes each analyzer implausibly using the whole sample size. In this paper, we present a new criterion for MPCA called hierarchical BIC in which each analyzer is penalized using its own effective sample size only. Theoretically, hierarchical BIC is a large sample approximation of variational Bayesian lower bound and BIC is a further approximation of hierarchical BIC. To learn hierarchical-BIC-based MPCA, we propose two efficient algorithms: two-stage and one-stage variants. The two-stage algorithm integrates model selection with respect to the subspace dimensions into parameter estimation, and the one-stage variant further integrates the selection of the number of mixture components into a single algorithm. Experiments on a number of synthetic and real-world data sets show that: 1) hierarchical BIC is more accurate than BIC and several related competitors and 2) the two proposed algorithms are not only effective but also much more efficient than the classical two-stage procedure commonly used for BIC.
Keywords :
Bayes methods; learning (artificial intelligence); parameter estimation; principal component analysis; Bayesian information criterion; hierarchical-BIC-based MPCA learning; mixtures of probabilistic PCA; mixtures of probabilistic principal component analyzers; model selection; one-stage variants; parameter estimation; subspace dimensions; two-stage variants; variational Bayesian lower bound; Analytical models; Approximation algorithms; Data models; Maximum likelihood estimation; Parameter estimation; Principal component analysis; Probabilistic logic; BIC; clustering; expectation maximization; mixture model; model selection; principal component analysis;
fLanguage :
English
Journal_Title :
Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
2168-2267
Type :
jour
DOI :
10.1109/TCYB.2014.2298401
Filename :
6714575
Link To Document :
بازگشت