DocumentCode :
3694799
Title :
A genomics-based profanity-safe Web forum
Author :
Christian Mogollón Pinzón;Sergio Rojas-Galeano
Author_Institution :
School of Engineering, Universidad Distrital, Bogotá
fYear :
2015
Firstpage :
425
Lastpage :
430
Abstract :
User-generated text is the primary source of interaction in virtual communities on Web2.0 applications such as forums, blogs or social networks. Unfortunately some users abuse this freedom of speech liberty to disseminate non-authorised profanity content (foul language, insults, advertisement, boosting or denigration of a name or a trademark). Naïve filters based on literal comparisons against black-lists of forbidden terms, fail to detect variations obtained by character transliteration or masking (e.g. writing piss as P!55 or p.i.s.s). Recent approaches to this problem inspired in sequence alignment methods from comparative genomics in bioinformatics, have shown promise in preventing overlooking such variants. Building upon those results we have developed an experimental Web forum allowing users to generate text that is screened against transliterated profanity. In this paper we introduce the software (ForumForte) and describe briefly the technique and engineering behind it. We anticipate this kind of tools might prove beneficial for content moderation in mainstream applications such as newspaper forums and micro-blogging social networking sites. Our software is open-source under the New BSD License and is available at: http://tinyurl.com/ForumForte.
Keywords :
"Software","Genomics","Bioinformatics","Blogs","Vegetation","Servers","Organisms"
Publisher :
ieee
Conference_Titel :
Computing Colombian Conference (10CCC), 2015 10th
Type :
conf
DOI :
10.1109/ColumbianCC.2015.7333455
Filename :
7333455
Link To Document :
بازگشت