Title of article :
Investigating the missing data effect on credit scoring rule based models: The case of an Iranian bank
Author/Authors :
Sadatrasoul ، Mahdi - Kharazmi University , Hajimohammadi ، Zeynab Shahid Beheshti University
Pages :
12
From page :
1
To page :
12
Abstract :
Credit risk management is a process in which banks estimate probability of default (PD) for each loan applicant. Data sets of previous loan applicants are built by gathering their data, and these internal data sets are usually completed using external credit bureau’s data and finally used for estimating PD in banks. There is also a continuous interest for bank to use rule based classifiers to build their default prediction models. However, in practice the data records are usually incomplete and have some missing values and this make problems for banks, especially in credit risk portfolios which are low default and makes model rule based building complex. Several strategies could be used in order to handle the missing data issue. This paper used five missing value handling strategies including; ignoring, replacing with random, mean, C R tree induced values and elimination strategies in a real credit scoring dataset. Experimental results show that ignoring strategy consistently outperforms other methods on test data set, and suggest that the CHAID is a useful classifier for handling low default portfolios with missing value.
Keywords :
Credit Scoring , Banking Industry , Rule extraction , Missing data , Low default portfolio
Journal title :
Journal of Industrial Engineering and Management Studies
Serial Year :
2018
Journal title :
Journal of Industrial Engineering and Management Studies
Record number :
2462457
Link To Document :
بازگشت