• Title of article

    Evaluation of k-Nearest Neighbor classifier performance for direct marketing

  • Author/Authors

    Govindarajan، نويسنده , , M. and Chandrasekaran، نويسنده , , RM.، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2010
  • Pages
    6
  • From page
    253
  • To page
    258
  • Abstract
    Text data mining is a process of exploratory data analysis. Classification maps data into predefined groups or classes. It is often referred to as supervised learning because the classes are determined before examining the data. This paper describes the proposed k-Nearest Neighbor classifier that performs comparative cross-validation for the existing k-Nearest Neighbor classifier. The feasibility and the benefits of the proposed approach are demonstrated by means of data mining problem: direct marketing. Direct marketing has become an important application field of data mining. Comparative cross-validation involves estimation of accuracy by either stratified k-fold cross-validation or equivalent repeated random subsampling. While the proposed method may have a high bias; its performance (accuracy estimation in our case) may be poor due to a high variance. Thus the accuracy with the proposed k-Nearest Neighbor classifier was less than that with the existing k-Nearest Neighbor classifier, and the smaller the improvement in runtime the larger the improvement in precision and recall. In our proposed method we have determined the classification accuracy and prediction accuracy where the prediction accuracy is comparatively high.
  • Keywords
    DATA MINING , cross-validation , K-nearest neighbor , Runtime , Accuracy
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2010
  • Journal title
    Expert Systems with Applications
  • Record number

    2347088