Title :
Discriminative Relational Topic Models
Author :
Ning Chen ; Jun Zhu ; Fei Xia ; Bo Zhang
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
Abstract :
Relational topic models (RTMs) provide a probabilistic generative process to describe both the link structure and document contents for document networks, and they have shown promise on predicting network structures and discovering latent topic representations. However, existing RTMs have limitations in both the restricted model expressiveness and incapability of dealing with imbalanced network data. To expand the scope and improve the inference accuracy of RTMs, this paper presents three extensions: 1) unlike the common link likelihood with a diagonal weight matrix that allows the-same-topic interactions only, we generalize it to use a full weight matrix that captures all pairwise topic interactions and is applicable to asymmetric networks; 2) instead of doing standard Bayesian inference, we perform regularized Bayesian inference (RegBayes) with a regularization parameter to deal with the imbalanced link structure issue in real networks and improve the discriminative ability of learned latent representations; and 3) instead of doing variational approximation with strict mean-field assumptions, we present collapsed Gibbs sampling algorithms for the generalized relational topic models by exploring data augmentation without making restricting assumptions. Under the generic RegBayes framework, we carefully investigate two popular discriminative loss functions, namely, the logistic log-loss and the max-margin hinge loss. Experimental results on several real network datasets demonstrate the significance of these extensions on improving prediction performance.
Keywords :
Bayes methods; Markov processes; Monte Carlo methods; document handling; inference mechanisms; matrix algebra; RegBayes; asymmetric networks; collapsed Gibbs sampling algorithms; data augmentation; discriminative RTM; discriminative loss functions; discriminative relational topic models; document networks; full weight matrix; imbalanced link structure; imbalanced network data; latent topic representation discovery; logistic log-loss; max-margin hinge loss; network structure prediction; pairwise topic interactions; probabilistic generative process; regularized Bayesian inference; Analytical models; Bayes methods; Data models; Fasteners; Logistics; Predictive models; Training; Statistical network analysis; data augmentation; regularized Bayesian inference; relational topic models; statistical network analysis;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2014.2361129