• DocumentCode
    47768
  • Title

    Bayesian Nonparametric Models for Multiway Data Analysis

  • Author

    Xu, Zongben ; Yan, Fengping ; Qi, Yaoyao

  • Author_Institution
    School of Computer Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
  • Volume
    37
  • Issue
    2
  • fYear
    2015
  • fDate
    Feb. 2015
  • Firstpage
    475
  • Lastpage
    487
  • Abstract
    Tensor decomposition is a powerful computational tool for multiway data analysis. Many popular tensor decomposition approaches—such as the Tucker decomposition and CANDECOMP/PARAFAC (CP)—amount to multi-linear factorization. They are insufficient to model (i) complex interactions between data entities, (ii) various data types (e.g., missing data and binary data), and (iii) noisy observations and outliers. To address these issues, we propose tensor-variate latent nonparametric Bayesian models for multiway data analysis. We name these models InfTucker. These new models essentially conduct Tucker decomposition in an infinite feature space. Unlike classical tensor decomposition models, our new approaches handle both continuous and binary data in a probabilistic framework. Unlike previous Bayesian models on matrices and tensors, our models are based on latent Gaussian or t processes with nonlinear covariance functions. Moreover, on network data, our models reduce to nonparametric stochastic blockmodels and can be used to discover latent groups and predict missing interactions. To learn the models efficiently from data, we develop a variational inference technique and explore properties of the Kronecker product for computational efficiency. Compared with a classical variational implementation, this technique reduces both time and space complexities by several orders of magnitude. On real multiway and network data, our new models achieved significantly higher prediction accuracy than state-of-art tensor decomposition methods and blockmodels.
  • Keywords
    Bayes methods; Computational modeling; Data models; Gaussian processes; Matrix decomposition; Noise; Tensile stress; Algorithms for data and knowledge management; Gaussian process; Machine learning; Multiway analysis; network modeling; nonparametric Bayes; random graphs and exchangeable arrays; stochastic blockmodel; tensor/matrix factorization;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2013.201
  • Filename
    6629993