DocumentCode :
1830715
Title :
Predicting susceptibility to social bots on Twitter
Author :
Wald, Randall ; Khoshgoftaar, Taghi M. ; Napolitano, Antonio ; Sumner, Chris
Author_Institution :
Florida Atlantic Univ., Boca Raton, FL, USA
fYear :
2013
fDate :
14-16 Aug. 2013
Firstpage :
6
Lastpage :
13
Abstract :
The popularity of the Twitter social networking site has made it a target for social bots, which use increasingly-complex algorithms to engage users and pretend to be humans. While much research has studied how to identify such bots in the process of spam detection, little research has looked at the other side of the question - detecting users likely to be fooled by bots. In this paper, we examine a dataset consisting of 610 users who were messaged by Twitter bots, and determine which features describing these users were most helpful in predicting whether or not they would interact with the bots (through replies or following the bot). We then use six classifiers to build models for predicting whether a given user will interact with the bot, both using the selected features and using all features. We find that a users´ Klout score, friends count, and followers count are most predictive of whether a user will interact with a bot, and that the Random Forest algorithm produces the best classifier, when used in conjunction with one of the better feature ranking algorithms (although poor feature ranking can actually make performance worse than no feature ranking). Overall, these results show promise for helping understand which users are most vulnerable to social bots.
Keywords :
social networking (online); software agents; unsolicited e-mail; Klout score; Twitter bots; Twitter social networking site; feature ranking algorithms; increasingly-complex algorithms; random forest algorithm; social bots; spam detection; susceptibility prediction; Feature extraction; Measurement; Pragmatics; Predictive models; Support vector machines; Twitter; Twitter; feature selection; social bots;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration (IRI), 2013 IEEE 14th International Conference on
Conference_Location :
San Francisco, CA
Type :
conf
DOI :
10.1109/IRI.2013.6642447
Filename :
6642447
Link To Document :
بازگشت