• DocumentCode
    1733493
  • Title

    A feature selection model for binary classification of imbalanced data based on preference for target instances

  • Author

    Tan, Ding-Wen ; Liew, Soung-Yue ; Tan, Teik-Boon ; Yeoh, William

  • Author_Institution
    Univ. Tunku Abdul Rahman, Kampar, Malaysia
  • fYear
    2012
  • Firstpage
    35
  • Lastpage
    42
  • Abstract
    Telemarketers of online job advertising firms face significant challenges understanding the advertising demands of small-sized enterprises. The effective use of data mining approach can offer e-recruitment companies an improved understanding of customers´ patterns and greater insights of purchasing trends. However, prior studies on classifier built by data mining approach provided limited insights into the customer targeting problem of job advertising companies. In this paper we develop a single feature evaluator and propose an approach to select a desired feature subset by setting a threshold. The proposed feature evaluator demonstrates its stability and outstanding performance through empirical experiments in which real-world customer data of an e-recruitment firm are used. Practically, the findings together with the model may help telemarketers to better understand their customers. Theoretically, this paper extends existing research on feature selection for binary classification of imbalanced data.
  • Keywords
    advertising; consumer behaviour; customer profiles; data mining; feature extraction; pattern classification; recruitment; advertising demand; binary classification; customer data; customer pattern; customer targeting problem; data mining; e-recruitment company; feature evaluator; feature selection model; feature subset; imbalanced data; job advertising company; online job advertising firm; preference; purchasing trend; small-sized enterprise; target instances; telemarketers; Artificial neural networks; Classification algorithms; Companies; Data mining; Data models; Mathematical model; Training; attribute selection; binary classification; customer targeting; data mining; feature selection; imbalanced data; variable selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining and Optimization (DMO), 2012 4th Conference on
  • Conference_Location
    Langkawi
  • Print_ISBN
    978-1-4673-2717-6
  • Type

    conf

  • DOI
    10.1109/DMO.2012.6329795
  • Filename
    6329795