• DocumentCode
    441662
  • Title

    Dynamic rules´ score adjustment in spam filter using users´ feedback

  • Author

    Wang, Xin ; Duan, Hai-Xin ; Anh, Tran Q. ; Li, Xue-Nong

  • Author_Institution
    Network Res. Center, Tsinghua Univ., Beijing, China
  • Volume
    2
  • fYear
    2005
  • fDate
    18-21 Aug. 2005
  • Firstpage
    665
  • Abstract
    An interactive spam filter is proposed in this paper to reduce misclassification of spam. A set of weighted rules are applied to an email to decide if it´s a spam. If a rule is triggered, the weight of it would be added. When the sum score of the total rules triggered is bigger than a threshold, the email would be classified as spam. The scores of rules are achieved by improved genetic algorithm. The improved genetic algorithm is applied to a corpus contains both legitimate spam and emails to get the initial score of each rule. During the filter process, end users´ feedback is collected and used to adjust the scores of rules. Because retraining the whole data set is time consuming, the incremental learning algorithm is used to adjust rules´ scores. The method is implemented and tested in one of the email servers in Network Research Center of Tsinghua University. The experimental result shows the false positive rate is reduced significantly by adding the user interaction to the spam filter.
  • Keywords
    genetic algorithms; information filters; interactive systems; learning (artificial intelligence); unsolicited e-mail; dynamic rule score adjustment; email; genetic algorithm; incremental learning algorithm; interactive spam filter; spam misclassification; user feedback; Electronic mail; Feedback; Genetic algorithms; Information filtering; Information filters; Intelligent networks; Internet; Network servers; Testing; Unsolicited electronic mail; Spam filtering; improved genetic algorithm; incremental learning; interactive;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
  • Conference_Location
    Guangzhou, China
  • Print_ISBN
    0-7803-9091-1
  • Type

    conf

  • DOI
    10.1109/ICMLC.2005.1527028
  • Filename
    1527028