• DocumentCode
    658603
  • Title

    Real Time Micro-blog Summarization Based on Hadoop/HBase

  • Author

    Sanghoon Lee ; Shakya, Sunny ; Sunderraman, R. ; Belkasim, Saeid

  • Author_Institution
    Dept. of Comput. Sci., Georgia State Univ., Atlanta, GA, USA
  • Volume
    3
  • fYear
    2013
  • fDate
    17-20 Nov. 2013
  • Firstpage
    46
  • Lastpage
    49
  • Abstract
    Micro-blog is a medium of communication that allows users to communicate with each other via short contents. Using the micro-blog as a way of spreading information more broadly has gained much interest as a new social medium where the contents can be delivered in real-time. However, the users should take the trouble to read manually through the posts for understanding a specific topic since the posts have been sorted by time, not relevancy. In this paper, we present a real time application that summarizes the posts by relevancy, considering the time that the posts are written. We set Hadoop environment with HBase since the application needs to be scalable and also, fault-tolerant. Summaries that the application produces are evaluated by ROUGE metric which is a well-known summary evaluation method. The evaluation result indicates that the summaries produced by the application show better results comparing to summaries generated by a traditional summarization method.
  • Keywords
    Web sites; distributed databases; Hadoop-HBase; ROUGE metric; post summarization; real time microblog summarization; summary evaluation method; Cloud computing; Conferences; Fuzzy sets; Measurement; Real-time systems; Speech; Twitter; HBase; Mocro-blog; Summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2013 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4799-2902-3
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2013.148
  • Filename
    6690692