• DocumentCode
    64450
  • Title

    Leveraging Social Networks for Effective Spam Filtering

  • Author

    Haiying Shen ; Ze Li

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Clemson Univ., Clemson, SC, USA
  • Volume
    63
  • Issue
    11
  • fYear
    2014
  • fDate
    Nov. 2014
  • Firstpage
    2743
  • Lastpage
    2759
  • Abstract
    The explosive growth of unsolicited e-mails has prompted the development of numerous spam filter techniques. Bayesian spam filters are superior to static keyword-based spam filters in that they can continuously evolve to tackle new spam by learning keywords in new spam emails. However, Bayesian spam filters are easily poisoned by clever spammers who avoid spam keywords and add many innocuous words in their emails. Also, Bayesian spam filters need a significant amount of time to adapt to a new spam based on user feedback. Moreover, few current spam filters exploit social networks to assist in spam detection. In order to develop an accurate and user-friendly spam filter, we propose a SOcial network Aided Personalized and effective spam filter (SOAP) in this paper. In SOAP, each node connects to its social friends; i.e., nodes form a distributed overlay by directly using social network links as overlay links. Each node uses SOAP to collect information and check spam autonomously in a distributed manner. Unlike previous spam filters that focus on parsing keywords (e.g., Bayesian filters) or building blacklists, SOAP exploits the social relationships among email correspondents and their (dis)interests to detect spam adaptively and automatically. In each node, SOAP integrates four components into the basic Bayesian filter: social closeness-based spam filtering, social interest-based spam filtering, adaptive trust management, and friend notification. We have evaluated the performance of SOAP using simulation based on trace data from Facebook. We also have implemented a SOAP prototype for real-world experiments. Experimental results show that SOAP can greatly improve the performance of Bayesian spam filters in terms of accuracy, attack-resilience, and efficiency of spam detection. The performance of the Bayesian spam filter is SOAP´s lower bound.
  • Keywords
    Bayes methods; information filtering; performance evaluation; social networking (online); trusted computing; unsolicited e-mail; Bayesian filter; Bayesian spam filters; Facebook; SOAP prototype; adaptive trust management; attack-resilience; clever spammers; distributed overlay; email correspondents; friend notification; overlay links; parsing keywords; performance evaluation; social closeness-based spam filtering; social friends; social interest-based spam filtering; social network aided personalization; social network links; social networks; spam detection; spam emails; spam filter techniques; spam keywords; static keyword-based spam filters; trace data; unsolicited e-mails; user feedback; user-friendly spam filter; Bayes methods; Simple object access protocol; Social network services; Unsolicited electronic mail; Bayesian spam filters; Distributed overlays; social networks; spam filtering;
  • fLanguage
    English
  • Journal_Title
    Computers, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9340
  • Type

    jour

  • DOI
    10.1109/TC.2013.152
  • Filename
    6572788