DocumentCode :
1733493
Title :
A feature selection model for binary classification of imbalanced data based on preference for target instances
Author :
Tan, Ding-Wen ; Liew, Soung-Yue ; Tan, Teik-Boon ; Yeoh, William
Author_Institution :
Univ. Tunku Abdul Rahman, Kampar, Malaysia
fYear :
2012
Firstpage :
35
Lastpage :
42
Abstract :
Telemarketers of online job advertising firms face significant challenges understanding the advertising demands of small-sized enterprises. The effective use of data mining approach can offer e-recruitment companies an improved understanding of customers´ patterns and greater insights of purchasing trends. However, prior studies on classifier built by data mining approach provided limited insights into the customer targeting problem of job advertising companies. In this paper we develop a single feature evaluator and propose an approach to select a desired feature subset by setting a threshold. The proposed feature evaluator demonstrates its stability and outstanding performance through empirical experiments in which real-world customer data of an e-recruitment firm are used. Practically, the findings together with the model may help telemarketers to better understand their customers. Theoretically, this paper extends existing research on feature selection for binary classification of imbalanced data.
Keywords :
advertising; consumer behaviour; customer profiles; data mining; feature extraction; pattern classification; recruitment; advertising demand; binary classification; customer data; customer pattern; customer targeting problem; data mining; e-recruitment company; feature evaluator; feature selection model; feature subset; imbalanced data; job advertising company; online job advertising firm; preference; purchasing trend; small-sized enterprise; target instances; telemarketers; Artificial neural networks; Classification algorithms; Companies; Data mining; Data models; Mathematical model; Training; attribute selection; binary classification; customer targeting; data mining; feature selection; imbalanced data; variable selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining and Optimization (DMO), 2012 4th Conference on
Conference_Location :
Langkawi
Print_ISBN :
978-1-4673-2717-6
Type :
conf
DOI :
10.1109/DMO.2012.6329795
Filename :
6329795
Link To Document :
بازگشت