Title :
F-score-like measure: A new measure for spam filtering
Author :
Han, Yong ; Qi, Hao-liang ; Yang, Mu-yun
Author_Institution :
Sch. of Comput. Sci. & Technol., Heilongjiang Inst. of Technol., Harbin, China
Abstract :
Logistic average misclassification percentage (lam%) and I-AVC(area under the ROC curve) are two important and wildly adopted measures. This paper demonstrates that a spam filter can achieve a perfect 0.00% in lam%, the minimal value in theory, by simply setting a biased threshold during the classifier modeling. At the same time, I-AVC is left untouched; and the overall classification performance reaches only a low accuracy. This means that lam% and I-AVC as main measures for spam filtering are not suitable. To solve the problem of measuring spam filtering, F-score-like measure based on ham and spam misclassification is proposed to be a single measure for spam filtering evaluation.
Keywords :
classification; information filtering; sensitivity analysis; unsolicited e-mail; F-score-like measure; I-AVC; area under the ROC curve; biased threshold; classifier modeling; ham misclassification; logistic average misclassification percentage; spam filtering; spam misclassification; Abstracts; Accuracy; Computational modeling; Filtering; Unsolicited electronic mail; Evaluation measure; F-score-like measure; I-AVC; Logistic average misclassification percentage; Spam filter;
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location :
Xian
Print_ISBN :
978-1-4673-1484-8
DOI :
10.1109/ICMLC.2012.6359691