DocumentCode
1942004
Title
Prevalence and mitigation of forum spamming
Author
Shin, Youngsang ; Gupta, Minaxi ; Myers, Steven
Author_Institution
Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
fYear
2011
fDate
10-15 April 2011
Firstpage
2309
Lastpage
2317
Abstract
Forums on the Web are increasingly spammed by miscreants in order to attract visitors to their (often malicious) websites. In this paper, we study the prevalence of forum spamming and find that Internet users are at a high risk of encountering forums with spam links posted on them. To mitigate the problem, we examine the characteristics of 286 days of forum spam posted at a research blog and develop light-weight features based on spammers´ IP, commenting activity and the anatomy of their posts. We find that an SVM classifier trained on these features can achieve a 99.81% precision and 92.82% recall in identifying forum spam.
Keywords
Internet; support vector machines; unsolicited e-mail; IP; Internet; SVM classifier; Web; forum spamming; research blog; Blogs; IP networks; Information services; Internet; Search engines; Software; Unsolicited electronic mail;
fLanguage
English
Publisher
ieee
Conference_Titel
INFOCOM, 2011 Proceedings IEEE
Conference_Location
Shanghai
ISSN
0743-166X
Print_ISBN
978-1-4244-9919-9
Type
conf
DOI
10.1109/INFCOM.2011.5935048
Filename
5935048
Link To Document