DocumentCode :
1831493
Title :
Diversity of feature selection approaches combined with distinct classifiers
Author :
Li Feng-Chia ; Wang Peng-Kai ; Yeh Li-Lon
Author_Institution :
Dept. of Inf. Manage., Jen Teh Junior Coll., Miaoli, Taiwan
fYear :
2010
fDate :
7-10 Dec. 2010
Firstpage :
28
Lastpage :
32
Abstract :
The credit scoring has been regarded as a critical topic and its related departments make efforts to collect huge amount of data to avoid wrong decision. An effective classificatory model will objectively help managers instead of intuitive experience. This study proposes five approaches combining with the back-propagation neural network (BPN) classifier for features selection that retains sufficient information for classification purpose. Different credit scoring models are constructed by selecting attributes with five approaches. Two UCI (University of California, Irvine) data sets are chosen to evaluate the accuracy of various hybrid-BPN models. BPN classifier combines with conventional statistical LDA, Decision tree, Rough sets theory, F-score and Gray relation approaches as features preprocessing step to optimize feature space by removing both irrelevant and redundant features. In this paper, the procedure of the proposed approaches will be described and then evaluated by their performances. The results are compared in combination with BPN classifier and nonparametric Wilcoxon signed rank test will be held to show if there is any significant difference between these models. The result in this study suggests that hybrid credit scoring approach is mostly robust and effective in finding optimal subsets and is a promising method to the fields of data mining.
Keywords :
backpropagation; decision trees; neural nets; pattern classification; rough set theory; statistical analysis; F-score; backpropagation neural network classifier; decision tree; distinct classifiers; feature selection approaches; gray relation approaches; nonparametric Wilcoxon signed rank test; rough sets theory; statistical LDA; Accuracy; Classification algorithms; Computational modeling; Data mining; Data models; Decision trees; Rough sets; Back-propagation neural network; Decision tree; F-score; Gray relational analysis; Linear discriminate analysis; Rough sets theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Engineering and Engineering Management (IEEM), 2010 IEEE International Conference on
Conference_Location :
Macao
ISSN :
2157-3611
Print_ISBN :
978-1-4244-8501-7
Electronic_ISBN :
2157-3611
Type :
conf
DOI :
10.1109/IEEM.2010.5674600
Filename :
5674600
Link To Document :
بازگشت