Title :
A comparison of data mining techniques in evaluating retail credit scoring using R programming
Author :
Dilmurat Zakirov;Aleksey Bondarev;Nodar Momtselidze
Author_Institution :
Information Technologies Department, Demir Kyrgyz International Bank, Bishkek, Kyrgyzstan
Abstract :
Retail credit scoring has become more efficient in recent years because of the use of data mining techniques that allow marketing officers and top managers to better estimate their customers credibility. In recent years, many complicated models have been developed; however there are few of them which continues to be used because of its efficiency and simplicity. This study investigates k-Nearest Neighbourhood (kNN), support vector machines (SVMs), gradient boosted model (GBM), Naive Bayes classification, Classification and Regression Tree (CART) and Random Forest (RF) as analytical methods for customer credit scoring estimation and evaluation, using real dataset. At the end of the study it is found that Random Forest model with down-sampling (RF_US) has better accuracy rate when compared to other models.
Keywords :
"Data mining","Support vector machines","Vegetation","Boosting","Computational modeling","Predictive models","Regression tree analysis"
Conference_Titel :
Electronics Computer and Computation (ICECCO), 2015 Twelve International Conference on
DOI :
10.1109/ICECCO.2015.7416867