• DocumentCode
    2850684
  • Title

    Spam filtering using a Markov random field model with variable weighting schemas

  • Author

    Chhabra, Shalendra ; Yerazunis, William S. ; Siefkes, Christian

  • Author_Institution
    UC, Riverside, CA, USA
  • fYear
    2004
  • fDate
    1-4 Nov. 2004
  • Firstpage
    347
  • Lastpage
    350
  • Abstract
    In this paper we present a Markov random field model based approach to filter spam. Our approach examines the importance of the neighborhood relationship (MRF cliques) among words in an email message for the purpose of spam classification. We propose and test several different theoretical bases for weighting schemes among corresponding neighborhood windows. Our results demonstrate that unexpected side effects depending on the neighborhood window size may have larger accuracy impact than the neighborhood relationship effects of the Markov random field.
  • Keywords
    Markov processes; information filtering; pattern classification; unsolicited e-mail; Markov random field; email; spam classification; spam filtering; variable weighting schemas; Bayesian methods; Computer hacking; Filtering; Filters; Markov random fields; Polynomials; Random variables; Testing; Text categorization; Unsolicited electronic mail;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
  • Print_ISBN
    0-7695-2142-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2004.10031
  • Filename
    1410307