• DocumentCode
    3124652
  • Title

    The Joint Inference of Topic Diffusion and Evolution in Social Communities

  • Author

    Lin, Cindy Xide ; Mei, Qiaozhu ; Han, Jiawei ; Jiang, Yunliang ; Danilevsky, Marina

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    378
  • Lastpage
    387
  • Abstract
    The prevalence of Web 2.0 techniques has led to the boom of various online communities, where topics spread ubiquitously among user-generated documents. Working together with this diffusion process is the evolution of topic content, where novel contents are introduced by documents which adopt the topic. Unlike explicit user behavior (e.g., buying a DVD), both the diffusion paths and the evolutionary process of a topic are implicit, making their discovery challenging. In this paper, we track the evolution of an arbitrary topic and reveal the latent diffusion paths of that topic in a social community. A novel and principled probabilistic model is proposed which casts our task as an joint inference problem, which considers textual documents, social influences, and topic evolution in a unified way. Specifically, a mixture model is introduced to model the generation of text according to the diffusion and the evolution of the topic, while the whole diffusion process is regularized with user-level social influences through a Gaussian Markov Random Field. Experiments on both synthetic data and real world data show that the discovery of topic diffusion and evolution benefits from this joint inference, and the probabilistic model we propose performs significantly better than existing methods.
  • Keywords
    Gaussian processes; Internet; Markov processes; inference mechanisms; probability; random processes; social networking (online); text analysis; Gaussian Markov random field; Web 2.0 techniques; joint inference problem; latent diffusion paths; mixture model; online communities; probabilistic model; real world data; social community; social influence; synthetic data; text generation; textual document; topic content; topic diffusion process; topic evolution; user generated document; user level social influence; Communities; Computational modeling; Diffusion processes; Joints; Social network services; Tides; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.144
  • Filename
    6137242