Title of article :
Automatically computed document dependent weighting factor facility for Naïve Bayes classification
Author/Authors :
Lee، نويسنده , , Lam Hong and Isa، نويسنده , , Dino، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2010
Pages :
8
From page :
8471
To page :
8478
Abstract :
The Naïve Bayes classification approach has been widely implemented in real-world applications due to its simplicity and low cost training and classifying algorithm. As a trade-off to its simplicity, the Naïve Bayes technique has thus been reported to be one of the poorest-performing classification methods around. We have explored and investigated the Naïve Bayes classification approach and found that one of the reasons that causes the low classification accuracy is the mis-classification of documents into several “popular” categories due to the improper organization of the training dataset where the distribution of training documents among categories is highly skewed. In this work, we propose a solution to the problem addressed above, which is the addition of the Automatically Computed Document Dependent (ACDD) weighting factor facility to the Naïve Bayes classifier. The ACDD weighting factors are computed for the purpose of enhancing the classification performance by adjusting the probability values based on the density of classified documents in each available category to minimize the mis-classification rate.
Keywords :
Text document classification , ACDD weighting factor facility , naïve Bayes
Journal title :
Expert Systems with Applications
Serial Year :
2010
Journal title :
Expert Systems with Applications
Record number :
2348572
Link To Document :
بازگشت