DocumentCode
3761874
Title
A study on topics identification on Twitter using clustering algorithms
Author
Marjori N. M. Klinczak;Celso A. A. Kaestner
Author_Institution
Mosaic Web and Graduate Program in Applied Computer Science, Federal University of Technology - Paran? Avenida Sete de Setembro 3165 80230-901 Curitiba - Paran? - Brazil
fYear
2015
Firstpage
1
Lastpage
6
Abstract
The identification of topics in Social Networks has become an important research task when dealing with event detection, particularly when global communities are affected. Text processing techniques and machine learning algorithms have been extensively used to solve this problem. In this paper we compare three clustering algorithms - k-means, k-medoids and NMF (Non-negative Matrix Factorization) - in order to detect topics related to textual messages obtained from Twitter. The algorithms were applied to a database composed by tweets, having as initial context hashtags that are related to the recent scandal of corruption involving FIFA (International Federation of Football Association). Obtained results suggest that the NMF presents better results, since it provides providing clusters that are easier to interpret.
Keywords
"Clustering algorithms","Twitter","Algorithm design and analysis","Principal component analysis","Text processing","Context"
Publisher
ieee
Conference_Titel
Computational Intelligence (LA-CCI), 2015 Latin America Congress on
Type
conf
DOI
10.1109/LA-CCI.2015.7435965
Filename
7435965
Link To Document