DocumentCode :
2854896
Title :
Features selection approaches combined with effective classifiers in credit scoring
Author :
Lin, Chia-Ching ; Chang, Chin-Chih ; Li, Feng-Chia ; Chao, Tzu-Chin
Author_Institution :
Dept. of Appl. English, Yu Da Univ., Miaoli, Taiwan
fYear :
2011
fDate :
6-9 Dec. 2011
Firstpage :
752
Lastpage :
757
Abstract :
With the rapid growth in the credit industry, credit scoring models are being widely used for credit admission evaluation. Credit scoring has been regarded as a critical topic, with its related departments striving to collect huge amounts of data to avoid making the wrong decision. Finding effective classificatory models is important because it will help managers make an objective decision instead of them having to rely merely on intuitive experience. This study proposes three approaches which combine two well-known classifiers, namely, K-Nearest Neighbor (KNN) and Support Vector Machine (SVM), to find the best hybrid classifier combination. Features selection retains sufficient information for classification purposes. Different credit scoring combinations are constructed by selecting features with three approaches and two classifiers. Two credit data sets from University of California, Irvine (UCI) are chosen to evaluate the accuracy of various hybrid features selection models. KNN abd SVM classifiers combine with linear discriminate analysis (LDA), Rough sets (RST), and F-score approaches as a features preprocessing step to optimize features space by removing both irrelevant and redundant features. In this paper, the procedures of the proposed approaches are described and then evaluated by their performances. The results are compared and nonparametric test will be performed to show if there is any significant difference between these models. Performances of the F-score approach combined with effective classifiers are brilliant among the two data sets. The result of this study suggests that the hybrid credit scoring approach is mostly robust and effective in finding optimal subsets and is a promising method in the field of data mining.
Keywords :
finance; pattern classification; statistical analysis; support vector machines; Features selection approaches; KNN; SVM; credit admission evaluation; credit industry; credit scoring; k-nearest neighbor; linear discriminate analysis; rough sets; support vector machine; Accuracy; Data mining; Data models; Face; Support vector machines; Training; Training data; F-score; KNN; LDA; RST; SVM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Engineering and Engineering Management (IEEM), 2011 IEEE International Conference on
Conference_Location :
Singapore
ISSN :
2157-3611
Print_ISBN :
978-1-4577-0740-7
Electronic_ISBN :
2157-3611
Type :
conf
DOI :
10.1109/IEEM.2011.6118017
Filename :
6118017
Link To Document :
بازگشت