• DocumentCode
    2982189
  • Title

    Detecting Spam and Promoting Campaigns in the Twitter Social Network

  • Author

    Xianchao Zhang ; Shaoping Zhu ; Wenxin Liang

  • Author_Institution
    Sch. of Software, Dalian Univ. of Technol., Dalian, China
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    1194
  • Lastpage
    1199
  • Abstract
    The Twitter social network has become a target platform for both promoters and stammers to disseminate their target messages. There are a large number of campaigns containing coordinated spam or promoting accounts in Twitter, which are more harmful than the traditional methods, such as email spamming. Since traditional solutions mainly check individual accounts or messages, it is an urgent task to detect spam and promoting campaigns in Twitter. In this paper, we propose a scalable framework to detect both spam and promoting campaigns. Our framework consists of three steps: firstly linking accounts who post URLs for similar purposes, secondly extracting candidate campaigns which may exist for spam or promoting purpose and finally distinguishing their intents. One salient aspect of the framework is introducing a URL-driven estimation method to measure the similarity between accounts´ purposes of posting URLs, the other one is proposing multiple features to distinguish the candidate campaigns based on a machine learning method. Over a large-scale dataset from Twitter, we can extract the actual campaigns with high precision and recall and distinguish the majority of the candidate campaigns correctly.
  • Keywords
    learning (artificial intelligence); social networking (online); unsolicited e-mail; Twitter social network; URL-driven estimation method; candidate campaign extraction; candidate campaigns; coordinated spam; email spamming; large-scale dataset; machine learning method; message dissemination; salient aspect; scalable campaign promotion framework; scalable spam detection framework; Educational institutions; Estimation; Feature extraction; Twitter; Unsolicited electronic mail; campaign detect; similarity measure; social spam;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4673-4649-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2012.28
  • Filename
    6413730