• DocumentCode
    64038
  • Title

    Automatic Relevance Determination in Nonnegative Matrix Factorization with the /spl beta/-Divergence

  • Author

    Tan, Vincent Y. F. ; Fevotte, Cedric

  • Author_Institution
    Inst. for Infocomm Res., A*STAR, Singapore, Singapore
  • Volume
    35
  • Issue
    7
  • fYear
    2013
  • fDate
    Jul-13
  • Firstpage
    1592
  • Lastpage
    1605
  • Abstract
    This paper addresses the estimation of the latent dimensionality in nonnegative matrix factorization (NMF) with the β-divergence. The β-divergence is a family of cost functions that includes the squared euclidean distance, Kullback-Leibler (KL) and Itakura-Saito (IS) divergences as special cases. Learning the model order is important as it is necessary to strike the right balance between data fidelity and overfitting. We propose a Bayesian model based on automatic relevance determination (ARD) in which the columns of the dictionary matrix and the rows of the activation matrix are tied together through a common scale parameter in their prior. A family of majorization-minimization (MM) algorithms is proposed for maximum a posteriori (MAP) estimation. A subset of scale parameters is driven to a small lower bound in the course of inference, with the effect of pruning the corresponding spurious components. We demonstrate the efficacy and robustness of our algorithms by performing extensive experiments on synthetic data, the swimmer dataset, a music decomposition example, and a stock price prediction task.
  • Keywords
    algorithm theory; matrix decomposition; Bayesian model; Itakura Saito divergence; Kullback Leibler divergence; activation matrix; automatic relevance determination; cost function; data fidelity; dictionary matrix; latent dimensionality; majorization minimization algorithm; maximum a posteriori estimation; model order; music decomposition example; nonnegative matrix factorization; spurious component; squared euclidean distance; stock price prediction task; swimmer dataset; synthetic data; Algorithm design and analysis; Bayesian methods; Cost function; Data models; Linear programming; Principal component analysis; Nonnegative matrix factorization; automatic relevance determination; group-sparsity; majorization-minimization; model order selection; Algorithms; Bayes Theorem; Computer Simulation; Databases, Factual; Economics; Humans; Models, Statistical; Models, Theoretical; Pattern Recognition, Automated; Swimming;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2012.240
  • Filename
    6341758