DocumentCode
2198223
Title
Solving Cross-Selling Problems with Ensemble Learning: A Case Study
Author
Guo, Xinjian ; Yin, Yilong ; Zhou, Guangtong ; Dong, Cailing
Author_Institution
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan
fYear
2008
fDate
20-22 Dec. 2008
Firstpage
128
Lastpage
133
Abstract
This paper shows our solution to PAKDD Competition 2007 as a case study of cross-selling problems. Following a brief description of the data mining task, we discuss several difficulties to be confronted with in the task from the view of data mining. Then, we show how to do the data pre-processing. In the solution we proposed, to weaken class imbalance of the modeling dataset externally, we combine under-sampling and over-sampling techniques. Besides, we adjust the parameters of each base learner internally to solve cost-sensitivity. Next, we get an ensemble of base learners to achieve a better predicting performance. Experimental results on prediction dataset of real world provided by PAKDD Competition 2007 show that our solution is effective and efficient with its AUC value 60.73%.
Keywords
data mining; learning (artificial intelligence); marketing data processing; PAKDD Competition 2007; cost-sensitivity; cross-selling problems; data mining; ensemble learning; over-sampling; under-sampling; Advertising; Association rules; Companies; Computer science; Costs; Credit cards; Data mining; Data preprocessing; Loans and mortgages; Predictive models;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Computer Theory and Engineering, 2008. ICACTE '08. International Conference on
Conference_Location
Phuket
Print_ISBN
978-0-7695-3489-3
Type
conf
DOI
10.1109/ICACTE.2008.86
Filename
4736935
Link To Document