DocumentCode
555757
Title
ETree: Effective and Efficient Event Modeling for Real-Time Online Social Media Networks
Author
Gu, Hansu ; Xie, Xing ; Lv, Qin ; Ruan, Yaoping ; Shang, Li
Author_Institution
Univ. of Colorado at Boulder, Boulder, CO, USA
Volume
1
fYear
2011
fDate
22-27 Aug. 2011
Firstpage
300
Lastpage
307
Abstract
Outline social media networks (OSMNs) such as Twitter provide great opportunities for public engagement and event information dissemination. Event-related discussions occur in real time and at the worldwide scale. However, these discussions are in the form of short, unstructured messages and dynamically woven into daily chats and status updates. Compared with traditional news articles, the rich and diverse user-generated content raises unique new challenges for tracking and analyzing events. Effective and efficient event modeling is thus essential for real-time information-intensive OSMNs. In this work, we propose ETree, an effective and efficient event modeling solution for social media network sites. Targeting the unique challenges of this problem, ETree consists of three key components: (1) an n-gram based content analysis technique for identifying core information blocks from a large number of short messages, (2) an incremental and hierarchical modeling technique for identifying and constructing event theme structures at different granularities, and (3) an enhanced temporal analysis technique for identifying inherent causalities between information blocks. Detailed evaluation using 3.5 million tweets over a 5-month period demonstrates that ETree can efficiently generate high-quality event structures and identify inherent causal relationships with high accuracy.
Keywords
content management; information dissemination; social networking (online); tree data structures; ETree; Twitter; core information block identification; event information dissemination; event modeling; event structures; event-related discussions; hierarchical modeling technique; incremental modeling technique; information-intensive OSMN; n-gram based content analysis technique; public engagement; real-time online social media networks; social media network sites; temporal analysis technique; Earthquakes; Electronic mail; Media; Real time systems; Semantics; Time frequency analysis; Twitter;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location
Lyon
Print_ISBN
978-1-4577-1373-6
Electronic_ISBN
978-0-7695-4513-4
Type
conf
DOI
10.1109/WI-IAT.2011.126
Filename
6036774
Link To Document