Title :
The Joint Inference of Topic Diffusion and Evolution in Social Communities
Author :
Lin, Cindy Xide ; Mei, Qiaozhu ; Han, Jiawei ; Jiang, Yunliang ; Danilevsky, Marina
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Abstract :
The prevalence of Web 2.0 techniques has led to the boom of various online communities, where topics spread ubiquitously among user-generated documents. Working together with this diffusion process is the evolution of topic content, where novel contents are introduced by documents which adopt the topic. Unlike explicit user behavior (e.g., buying a DVD), both the diffusion paths and the evolutionary process of a topic are implicit, making their discovery challenging. In this paper, we track the evolution of an arbitrary topic and reveal the latent diffusion paths of that topic in a social community. A novel and principled probabilistic model is proposed which casts our task as an joint inference problem, which considers textual documents, social influences, and topic evolution in a unified way. Specifically, a mixture model is introduced to model the generation of text according to the diffusion and the evolution of the topic, while the whole diffusion process is regularized with user-level social influences through a Gaussian Markov Random Field. Experiments on both synthetic data and real world data show that the discovery of topic diffusion and evolution benefits from this joint inference, and the probabilistic model we propose performs significantly better than existing methods.
Keywords :
Gaussian processes; Internet; Markov processes; inference mechanisms; probability; random processes; social networking (online); text analysis; Gaussian Markov random field; Web 2.0 techniques; joint inference problem; latent diffusion paths; mixture model; online communities; probabilistic model; real world data; social community; social influence; synthetic data; text generation; textual document; topic content; topic diffusion process; topic evolution; user generated document; user level social influence; Communities; Computational modeling; Diffusion processes; Joints; Social network services; Tides; Vectors;
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
Print_ISBN :
978-1-4577-2075-8
DOI :
10.1109/ICDM.2011.144