DocumentCode
162673
Title
A strategy for automatic moderation of a large data set of users comments
Author
Rodrigues Saude, Marcos ; de Medeiros Soares, Marcelo ; Gomes Basoni, Henrique ; Ciarelli, Patrick Marques ; Oliveira, Eunice
Author_Institution
Programa de Pos-Grad. em Inf. (PPGI), Univ. Fed. do Espirito Santo (UFES), Vitoria, Brazil
fYear
2014
fDate
15-19 Sept. 2014
Firstpage
1
Lastpage
7
Abstract
The increase use of social media and Web 2.0 are daily drawing more people to participate and express their point of views about a variety of subjects. However, there are a huge number of comments which are offensives and sometimes non-politically corrects and so must be hindered from coming up online. This is pushing the services providers to be more careful with the contents they publish to avoid judicial claims. This work proposes the use of automatic classification techniques to identify and only allow to go online harmless comments. We applied various techniques regarding with data processing, such as weighting of terms and the dimensionality reduction. All these techniques have been studied in order to model algorithms to be able to mimic well the human decisions regarding to the comments. The results indicate that we are able to mimic experts decision on 96.78% in the data set used.
Keywords
pattern classification; social networking (online); Web 2.0; automatic classification techniques; data processing; data set moderation; dimensionality reduction; social media; term weighting; users comments; Equations; Genetic algorithms; Mathematical model; Measurement; Sociology; Statistics; Vectors; automatic moderation; dimensionality reduction; feature selection; genetic algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing Conference (CLEI), 2014 XL Latin American
Conference_Location
Montevideo
Type
conf
DOI
10.1109/CLEI.2014.6965181
Filename
6965181
Link To Document