• DocumentCode
    555757
  • Title

    ETree: Effective and Efficient Event Modeling for Real-Time Online Social Media Networks

  • Author

    Gu, Hansu ; Xie, Xing ; Lv, Qin ; Ruan, Yaoping ; Shang, Li

  • Author_Institution
    Univ. of Colorado at Boulder, Boulder, CO, USA
  • Volume
    1
  • fYear
    2011
  • fDate
    22-27 Aug. 2011
  • Firstpage
    300
  • Lastpage
    307
  • Abstract
    Outline social media networks (OSMNs) such as Twitter provide great opportunities for public engagement and event information dissemination. Event-related discussions occur in real time and at the worldwide scale. However, these discussions are in the form of short, unstructured messages and dynamically woven into daily chats and status updates. Compared with traditional news articles, the rich and diverse user-generated content raises unique new challenges for tracking and analyzing events. Effective and efficient event modeling is thus essential for real-time information-intensive OSMNs. In this work, we propose ETree, an effective and efficient event modeling solution for social media network sites. Targeting the unique challenges of this problem, ETree consists of three key components: (1) an n-gram based content analysis technique for identifying core information blocks from a large number of short messages, (2) an incremental and hierarchical modeling technique for identifying and constructing event theme structures at different granularities, and (3) an enhanced temporal analysis technique for identifying inherent causalities between information blocks. Detailed evaluation using 3.5 million tweets over a 5-month period demonstrates that ETree can efficiently generate high-quality event structures and identify inherent causal relationships with high accuracy.
  • Keywords
    content management; information dissemination; social networking (online); tree data structures; ETree; Twitter; core information block identification; event information dissemination; event modeling; event structures; event-related discussions; hierarchical modeling technique; incremental modeling technique; information-intensive OSMN; n-gram based content analysis technique; public engagement; real-time online social media networks; social media network sites; temporal analysis technique; Earthquakes; Electronic mail; Media; Real time systems; Semantics; Time frequency analysis; Twitter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
  • Conference_Location
    Lyon
  • Print_ISBN
    978-1-4577-1373-6
  • Electronic_ISBN
    978-0-7695-4513-4
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2011.126
  • Filename
    6036774