DocumentCode :
3447365
Title :
Microblog bursty topic detection based on user relationship
Author :
Yanyan Du ; Yanxiang He ; Ye Tian ; Qiang Chen ; Lu Lin
Author_Institution :
Sch. of Comput., Wuhan Univ. Wuhan, Wuhan, China
Volume :
1
fYear :
2011
fDate :
20-22 Aug. 2011
Firstpage :
260
Lastpage :
263
Abstract :
Microblog is becoming more and more popular in people´s daily life, through which people can read and send short text messages. Considering millions of data posted every day, detecting bursty topic in real-time will be helpful for people to grasp central information. In this paper we propose a novel bursty topic detection technique based on an improved method by calculating term weight, the calculating method we used takes user weight and the number of listeners, replies and collections into account. We first use a novel aging theory to model a term life cycle, then calculate user weight through improved PageRank algorithm to express term weight, at last adopt a unsupervised learning algorithm to detect bursty topic, meanwhile provide the results and evaluation in contrast with TF-IDF (term frequency-inverse document frequency) and UF-ITUF (user frequency-inverse thread user frequency) model to prove the validity of our proposed approach.
Keywords :
document handling; information analysis; social networking (online); PageRank algorithm; TF-IDF; UF-ITUF; aging theory; bursty topic detection technique; central information; microblog bursty topic detection; short text messages; term frequency-inverse document frequency; term life cycle; term weight; unsupervised learning algorithm; user frequency-inverse thread user frequency model; user relationship; Adaptation models; Aging; Data mining; Feature extraction; Hidden Markov models; Time frequency analysis; Unsupervised learning; aging theory; bursty keywords; topic detection; user relationship;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology and Artificial Intelligence Conference (ITAIC), 2011 6th IEEE Joint International
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-8622-9
Type :
conf
DOI :
10.1109/ITAIC.2011.6030199
Filename :
6030199
Link To Document :
بازگشت