DocumentCode :
3264768
Title :
Variable Precision Neighborhood Rough Set Based Feature Selection for Credit Scoring
Author :
Yao, Ping
Author_Institution :
Coll. of Econ. & Manage., Heilongjiang Inst. of Sci. & Technol., Harbin, China
Volume :
2
fYear :
2009
fDate :
6-7 June 2009
Firstpage :
63
Lastpage :
66
Abstract :
As the credit industry has been growing rapidly, huge number of consumerspsila credit data are collected by the credit department of the bank and credit scoring has become a very important issue. Usually, a large amount of redundant information and features are involved in the credit dataset, which leads to lower accuracy and higher complexity of the credit scoring model, so, effective feature selection methods are necessary for credit dataset with huge number of features. This paper aims at comparing seven well-known feature selection methods for credit scoring. Which are t-test, principle component analysis (PCA), factor analysis (FA), stepwise regression, rough set (RS), classification and regression tree (CART) and multivariate adaptive regression splines (MARS). Support vector machine (SVM) is used as the classification model. Two credit scoring databases are used in order to provide a reliable conclusion. Regarding the experimental results, the CART and MARS methods outperform the other methods by the overall accuracy and type I error and type II error.
Keywords :
finance; principal component analysis; regression analysis; rough set theory; splines (mathematics); support vector machines; trees (mathematics); classification and regression tree; credit scoring; factor analysis; feature selection methods; multivariate adaptive regression splines; principle component analysis; stepwise regression; support vector machine; variable precision neighborhood rough set; Computational intelligence; Computer industry; Conference management; Educational institutions; Entropy; Fuzzy sets; Industrial economics; Regression tree analysis; Set theory; Technology management; classification and regression tree; credit scoring; feature selection; fuzzy entropy; support vector machine; variable precision neighorhood rough set;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Natural Computing, 2009. CINC '09. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3645-3
Type :
conf
DOI :
10.1109/CINC.2009.35
Filename :
5231034
Link To Document :
بازگشت