Title :
Multi-objective spam filtering using an evolutionary algorithm
Author :
Dudley, James ; Barone, Luigi ; While, Lyndon
Author_Institution :
Sch. of Comput. Sci. & Software Eng., Univ. of Western Australia, Perth, WA
Abstract :
SpamAssassin is a widely-used open source heuristic-based spam filter that applies a large number of weighted tests to a message, sums the results of the tests, and labels the message as spam if the sum exceeds a user-defined threshold. Due to the large number of tests and the interactions between them, defining good weights for SpamAssassin is difficult: moreover, users with different needs may desire different sets of weights to be used. We have built a multi-objective evolutionary algorithm MOSF that evolves weights for the tests in SpamAssassin according to two independent objectives: minimising the number of false positives (legitimate messages mislabeled as spam), and minimising the number of false negatives (spam messages mislabeled as legitimate). We show that MOSF returns a set of solutions offering a range of setups for SpamAssassin satisfying different userspsila needs, and also that MOSF can derive solutions which beat the existing SpamAssassin weights in both objectives simultaneously. Applying these ideas could substantially increase the usefulness of SpamAssassin and similar systems.
Keywords :
evolutionary computation; security of data; unsolicited e-mail; MOSF; SpamAssassin; multi-objective evolutionary algorithm; multi-objective spam filtering; spam messages; user-defined threshold; widely-used open source heuristic; Costs; Evolutionary computation; Filtering; Filters; Internet; Productivity; Switches; Testing; Unsolicited electronic mail; Wikipedia;
Conference_Titel :
Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1822-0
Electronic_ISBN :
978-1-4244-1823-7
DOI :
10.1109/CEC.2008.4630786