DocumentCode
1733493
Title
A feature selection model for binary classification of imbalanced data based on preference for target instances
Author
Tan, Ding-Wen ; Liew, Soung-Yue ; Tan, Teik-Boon ; Yeoh, William
Author_Institution
Univ. Tunku Abdul Rahman, Kampar, Malaysia
fYear
2012
Firstpage
35
Lastpage
42
Abstract
Telemarketers of online job advertising firms face significant challenges understanding the advertising demands of small-sized enterprises. The effective use of data mining approach can offer e-recruitment companies an improved understanding of customers´ patterns and greater insights of purchasing trends. However, prior studies on classifier built by data mining approach provided limited insights into the customer targeting problem of job advertising companies. In this paper we develop a single feature evaluator and propose an approach to select a desired feature subset by setting a threshold. The proposed feature evaluator demonstrates its stability and outstanding performance through empirical experiments in which real-world customer data of an e-recruitment firm are used. Practically, the findings together with the model may help telemarketers to better understand their customers. Theoretically, this paper extends existing research on feature selection for binary classification of imbalanced data.
Keywords
advertising; consumer behaviour; customer profiles; data mining; feature extraction; pattern classification; recruitment; advertising demand; binary classification; customer data; customer pattern; customer targeting problem; data mining; e-recruitment company; feature evaluator; feature selection model; feature subset; imbalanced data; job advertising company; online job advertising firm; preference; purchasing trend; small-sized enterprise; target instances; telemarketers; Artificial neural networks; Classification algorithms; Companies; Data mining; Data models; Mathematical model; Training; attribute selection; binary classification; customer targeting; data mining; feature selection; imbalanced data; variable selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining and Optimization (DMO), 2012 4th Conference on
Conference_Location
Langkawi
Print_ISBN
978-1-4673-2717-6
Type
conf
DOI
10.1109/DMO.2012.6329795
Filename
6329795
Link To Document