• DocumentCode
    1758156
  • Title

    Discriminative Relational Topic Models

  • Author

    Ning Chen ; Jun Zhu ; Fei Xia ; Bo Zhang

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • Volume
    37
  • Issue
    5
  • fYear
    2015
  • fDate
    May 1 2015
  • Firstpage
    973
  • Lastpage
    986
  • Abstract
    Relational topic models (RTMs) provide a probabilistic generative process to describe both the link structure and document contents for document networks, and they have shown promise on predicting network structures and discovering latent topic representations. However, existing RTMs have limitations in both the restricted model expressiveness and incapability of dealing with imbalanced network data. To expand the scope and improve the inference accuracy of RTMs, this paper presents three extensions: 1) unlike the common link likelihood with a diagonal weight matrix that allows the-same-topic interactions only, we generalize it to use a full weight matrix that captures all pairwise topic interactions and is applicable to asymmetric networks; 2) instead of doing standard Bayesian inference, we perform regularized Bayesian inference (RegBayes) with a regularization parameter to deal with the imbalanced link structure issue in real networks and improve the discriminative ability of learned latent representations; and 3) instead of doing variational approximation with strict mean-field assumptions, we present collapsed Gibbs sampling algorithms for the generalized relational topic models by exploring data augmentation without making restricting assumptions. Under the generic RegBayes framework, we carefully investigate two popular discriminative loss functions, namely, the logistic log-loss and the max-margin hinge loss. Experimental results on several real network datasets demonstrate the significance of these extensions on improving prediction performance.
  • Keywords
    Bayes methods; Markov processes; Monte Carlo methods; document handling; inference mechanisms; matrix algebra; RegBayes; asymmetric networks; collapsed Gibbs sampling algorithms; data augmentation; discriminative RTM; discriminative loss functions; discriminative relational topic models; document networks; full weight matrix; imbalanced link structure; imbalanced network data; latent topic representation discovery; logistic log-loss; max-margin hinge loss; network structure prediction; pairwise topic interactions; probabilistic generative process; regularized Bayesian inference; Analytical models; Bayes methods; Data models; Fasteners; Logistics; Predictive models; Training; Statistical network analysis; data augmentation; regularized Bayesian inference; relational topic models; statistical network analysis;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2014.2361129
  • Filename
    6914609