DocumentCode :
259694
Title :
TSD: Detecting Sybil Accounts in Twitter
Author :
Alsaleh, Mansour ; Alarifi, Abdulrahman ; Al-Salman, Abdul Malik ; Alfayez, Mohammed ; Almuhaysin, Abdulmajeed
Author_Institution :
Comput. Res. Inst., King Abdulaziz City for Sci. & Technol., Riyadh, Saudi Arabia
fYear :
2014
fDate :
3-6 Dec. 2014
Firstpage :
463
Lastpage :
469
Abstract :
Fake identities and user accounts (also called "Sybils") in online communities represent today a treasure for adversaries to spread fake product reviews, malware and spam on social networks, and Astroturf political campaigns. State-of-the-art in the defense mechanisms includes Automated Turing Tests (ATTs such as CAPTCHAs) and graph-based Sybil detectors. Sybil detectors in social networks leverage the assumption that Sybils will find it hard to befriend real users which leads to Sybils being connected to each other forming strongly connected sub graphs that can be detected using graph theory. However, the large majority of Sybils are in fact successful in integrating themselves into real user communities (such as the case in Twitter and Facebook). In this paper, we first study and compare the current detection mechanisms of Sybil accounts. We also explore various types of Twitter Sybil accounts detection features with the objective of building an effective and practical classifier. In order to build and evaluate our classifier, we collect and manually label a dataset of twitter accounts, including human users, bots, and hybrid (i.e., Tweets are posted by both human and bots). We believe this Twitter Sybils corpus will help researchers in conducting sound measurement studies. We also develop a browser plug-in (that we call Twitter Sybils Detector or TSD for short) that utilizes our classifier and warns the user about possible Sybil accounts before accessing them, upon clicking on a Twitter account.
Keywords :
graph theory; invasive software; social networking (online); unsolicited e-mail; ATT; Astroturf political campaigns; Sybil accounts detection; TSD; Twitter; automated Turing tests; fake identities; fake product reviews; graph theory; malware; online communities; social networks; spam; user accounts; Browsers; Decision trees; Feature extraction; Servers; Support vector machines; Twitter; Content spam; Fake user accounts; Social networks; Spamdexing; Sybil account; Twitter; Web spam;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2014 13th International Conference on
Conference_Location :
Detroit, MI
Type :
conf
DOI :
10.1109/ICMLA.2014.81
Filename :
7033160
Link To Document :
بازگشت