DocumentCode
660771
Title
Multi-tweet Summarization of Real-Time Events
Author
Khan, Muhammad Asif Hossain ; Bollegala, Danushka ; Guangwen Liu ; Sezaki, K.
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2013
fDate
8-14 Sept. 2013
Firstpage
128
Lastpage
133
Abstract
Popular real-time public events often cause upsurge of traffic in Twitter while the event is taking place. These posts range from real-time update of the event´s occurrences highlights of important moments thus far, personal comments and so on. A large user group has evolved who seeks these live updates to get a brief summary of the important moments of the event so far. However, major social search engines including Twitter still present the tweets satisfying the Boolean query in reverse chronological order, resulting in thousands of low quality matches agglomerated in a prosaic manner. To get an overview of the happenings of the event, a user is forced to read scores of uninformative tweets causing frustration. In this paper, we propose a method for multi-tweet summarization of an event. It allows the search users to quickly get an overview about the important moments of the event. We have proposed a graph-based retrieval algorithm that identifies tweets with popular discussion points among the set of tweets returned by Twitter search engine in response to a query comprising the event related keywords. To ensure maximum coverage of topical diversity, we perform topical clustering of the tweets before applying the retrieval algorithm. Evaluation performed by summarizing the important moments of a real-world event revealed that the proposed method could summarize the proceeding of different segments of the event with up to 81.6% precision and up to 80% recall.
Keywords
Boolean algebra; graph theory; information retrieval; search engines; social networking (online); text analysis; Boolean query; Twitter search engine; graph-based retrieval algorithm; multitweet summarization; real-time public event; social search engine; topical clustering; topical diversity; Clustering algorithms; Hidden Markov models; Linear programming; Real-time systems; Search engines; Twitter; Vectors; Social network analysis; Text mining; Tweet summarization; Twitter search;
fLanguage
English
Publisher
ieee
Conference_Titel
Social Computing (SocialCom), 2013 International Conference on
Conference_Location
Alexandria, VA
Type
conf
DOI
10.1109/SocialCom.2013.26
Filename
6693323
Link To Document