DocumentCode :
2850684
Title :
Spam filtering using a Markov random field model with variable weighting schemas
Author :
Chhabra, Shalendra ; Yerazunis, William S. ; Siefkes, Christian
Author_Institution :
UC, Riverside, CA, USA
fYear :
2004
fDate :
1-4 Nov. 2004
Firstpage :
347
Lastpage :
350
Abstract :
In this paper we present a Markov random field model based approach to filter spam. Our approach examines the importance of the neighborhood relationship (MRF cliques) among words in an email message for the purpose of spam classification. We propose and test several different theoretical bases for weighting schemes among corresponding neighborhood windows. Our results demonstrate that unexpected side effects depending on the neighborhood window size may have larger accuracy impact than the neighborhood relationship effects of the Markov random field.
Keywords :
Markov processes; information filtering; pattern classification; unsolicited e-mail; Markov random field; email; spam classification; spam filtering; variable weighting schemas; Bayesian methods; Computer hacking; Filtering; Filters; Markov random fields; Polynomials; Random variables; Testing; Text categorization; Unsolicited electronic mail;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
Type :
conf
DOI :
10.1109/ICDM.2004.10031
Filename :
1410307
Link To Document :
بازگشت