DocumentCode :
2840167
Title :
Spammer Behavior Analysis and Detection in User Generated Content on Social Networks
Author :
Enhua Tan ; Lei Guo ; Songqing Chen ; Xiaodong Zhang ; Yihong Zhao
Author_Institution :
Ohio State Univ., Columbus, OH, USA
fYear :
2012
fDate :
18-21 June 2012
Firstpage :
305
Lastpage :
314
Abstract :
Spam content is surging with an explosive increase of user generated content (UGC) on the Internet. Spammers often insert popular keywords or simply copy and paste recent articles from the Web with spam links inserted, attempting to disable content-based detection. In order to effectively detect spam in user generated content, we first conduct a comprehensive analysis of spamming activities on a large commercial UGC site in 325 days covering over 6 million posts and nearly 400 thousand users. Our analysis shows that UGC spammers exhibit unique non-textual patterns, such as posting activities, advertised spam link metrics, and spam hosting behaviors. Based on these non-textual features, we show via several classification methods that a high detection rate could be achieved offline. These results further motivate us to develop a runtime scheme, BARS, to detect spam posts based on these spamming patterns. The experimental results demonstrate the effectiveness and robustness of BARS.
Keywords :
Internet; classification; social networking (online); unsolicited e-mail; BARS; Internet; UGC; advertised spam link metrics; classification methods; content-based detection; runtime scheme; social networks; spam content; spam hosting behaviors; spam links; spammer behavior analysis; spammer behavior detection; unique nontextual patterns; user generated content; Bars; Blogs; Feature extraction; Runtime; Software; Unsolicited electronic mail;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Computing Systems (ICDCS), 2012 IEEE 32nd International Conference on
Conference_Location :
Macau
ISSN :
1063-6927
Print_ISBN :
978-1-4577-0295-2
Type :
conf
DOI :
10.1109/ICDCS.2012.40
Filename :
6258003
Link To Document :
بازگشت