• DocumentCode
    3761874
  • Title

    A study on topics identification on Twitter using clustering algorithms

  • Author

    Marjori N. M. Klinczak;Celso A. A. Kaestner

  • Author_Institution
    Mosaic Web and Graduate Program in Applied Computer Science, Federal University of Technology - Paran? Avenida Sete de Setembro 3165 80230-901 Curitiba - Paran? - Brazil
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    The identification of topics in Social Networks has become an important research task when dealing with event detection, particularly when global communities are affected. Text processing techniques and machine learning algorithms have been extensively used to solve this problem. In this paper we compare three clustering algorithms - k-means, k-medoids and NMF (Non-negative Matrix Factorization) - in order to detect topics related to textual messages obtained from Twitter. The algorithms were applied to a database composed by tweets, having as initial context hashtags that are related to the recent scandal of corruption involving FIFA (International Federation of Football Association). Obtained results suggest that the NMF presents better results, since it provides providing clusters that are easier to interpret.
  • Keywords
    "Clustering algorithms","Twitter","Algorithm design and analysis","Principal component analysis","Text processing","Context"
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence (LA-CCI), 2015 Latin America Congress on
  • Type

    conf

  • DOI
    10.1109/LA-CCI.2015.7435965
  • Filename
    7435965