• DocumentCode
    124225
  • Title

    Time Makes Sense: Event Discovery in Twitter Using Temporal Similarity

  • Author

    Stilo, Giovanni ; Velardi, Paola

  • Author_Institution
    Dipt. di Inf., Sapienza Univ. di Roma, Rome, Italy
  • Volume
    2
  • fYear
    2014
  • fDate
    11-14 Aug. 2014
  • Firstpage
    186
  • Lastpage
    193
  • Abstract
    Temporal text mining (TTM) has recently attracted the attention of scientists as a mean to discover and track in real-time discussions in micro-blogs. However current approaches to temporal mining suffer from efficiency problems when applied to large micro-blog streams, like Twitter, now reaching an average of 500 million tweets per daay. We propose a technique, named SAX (based on an algorithm named Symbolic Aggregate Approximation) to discretize the temporal series of terms into a small set of levels, leading to a string for each terms. We then define a subset of "interesting" strings, i.e. Those representing patterns of collective attention. Sliding temporal windows are used to detect clusters of terms with the same string. We show that SAX is more efficient (by orders of magnitude) than other approaches to temporal mining in literature. In this paper, we experiment SAX on the task of event discovery over one year 1% world while Twitter stream.
  • Keywords
    data mining; social networking (online); text analysis; SAX algorithm; TTM; Twitter; event discovery; microblog streams; sliding temporal windows; symbolic aggregate approximation algorithm; temporal similarity; temporal text mining; Acceleration; Aggregates; Algorithm design and analysis; Clustering algorithms; Complexity theory; Time series analysis; Twitter; Symbolic Aggregate approXimation; Twitter mining; event discovery; temporal text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Warsaw
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2014.97
  • Filename
    6927624