Title :
The topographic organization and visualization of binary data using multivariate-Bernoulli latent variable models
Author_Institution :
Div. of Comput. & Inf. Syst., Univ. of Paisley, UK
fDate :
11/1/2001 12:00:00 AM
Abstract :
A nonlinear latent variable model for the topographic organization and subsequent visualization of multivariate binary data is presented. The generative topographic mapping (GTM) is a nonlinear factor analysis model for continuous data which assumes an isotropic Gaussian noise model and performs uniform sampling from a two-dimensional (2-D) latent space. Despite the, success of the GTM when applied to continuous data the development of a similar model for discrete binary data has been hindered due, in part, to the nonlinear link function inherent in the binomial distribution which yields a log-likelihood that is nonlinear in the model parameters. The paper presents an effective method for the parameter estimation of a binary latent variable model-a binary version of the GTM-by adopting a variational approximation to the binomial likelihood. This approximation thus provides a log-likelihood which is quadratic in the model parameters and so obviates the necessity of an iterative M-step in the expectation maximization (EM) algorithm. The power of this method is demonstrated on two significant application domains, handwritten digit recognition and the topographic organization of semantically similar text-based documents
Keywords :
data visualisation; matrix algebra; parameter estimation; probability; self-organising feature maps; unsupervised learning; binary data; binary latent variable model; binomial likelihood; continuous data; data visualization; generative topographic mapping; handwritten digit recognition; isotropic Gaussian noise model; multivariate-Bernoulli latent variable models; nonlinear factor analysis model; nonlinear latent variable model; parameter estimation; semantically similar text-based documents; topographic organization; uniform sampling; variational approximation; Approximation algorithms; Clustering algorithms; Data visualization; Gaussian noise; Parameter estimation; Performance analysis; Principal component analysis; Sampling methods; Two dimensional displays; Unsupervised learning;
Journal_Title :
Neural Networks, IEEE Transactions on