• DocumentCode
    3601380
  • Title

    IncreSTS: Towards Real-Time Incremental Short Text Summarization on Comment Streams from Social Network Services

  • Author

    Cheng-Ying Liu ; Ming-Syan Chen ; Chi-Yao Tseng

  • Author_Institution
    Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • Volume
    27
  • Issue
    11
  • fYear
    2015
  • Firstpage
    2986
  • Lastpage
    3000
  • Abstract
    This paper focuses on the problem of short text summarization on the comment stream of a specific message from social network services (SNS). Due to the high popularity of SNS, the quantity of comments may increase at a high rate right after a social message is published. Motivated by the fact that users may desire to get a brief understanding of a comment stream without reading the whole comment list, we attempt to group comments with similar content together and generate a concise opinion summary for this message. Since distinct users will request the summary at any moment, existing clustering methods cannot be directly applied and cannot meet the real-time need of this application. In this paper, we model a novel incremental clustering problem for comment stream summarization on SNS. Moreover, we propose IncreSTS algorithm that can incrementally update clustering results with latest incoming comments in real time. Furthermore, we design an at-a-glance visualization interface to help users easily and rapidly get an overview summary. From extensive experimental results and a real case demonstration, we verify that IncreSTS possesses the advantages of high efficiency, high scalability, and better handling outliers, which justifies the practicability of IncreSTS on the target problem.
  • Keywords
    data visualisation; pattern clustering; social networking (online); text analysis; user interfaces; IncreSTS algorithm; SNS; at-a-glance visualization interface; clustering result update; comment stream; incremental short text summarization; opinion summary generation; social network service; Blogs; Clustering algorithms; Facebook; Real-time systems; Twitter; Vectors; Real-time short text summarization; comment streams; incremental clustering; real-time short text summarization; social network services;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2015.2405553
  • Filename
    7045508