مرکز منطقه ای اطلاع رساني علوم و فناوري - Investigating the missing data effect on credit scoring rule based models: The case of an Iranian bank

Title of article :

Investigating the missing data effect on credit scoring rule based models: The case of an Iranian bank

Author/Authors :

Sadatrasoul ، Mahdi - Kharazmi University , Hajimohammadi ، Zeynab Shahid Beheshti University

Pages :

From page :

To page :

Abstract :

Credit risk management is a process in which banks estimate probability of default (PD) for each loan applicant. Data sets of previous loan applicants are built by gathering their data, and these internal data sets are usually completed using external credit bureau’s data and finally used for estimating PD in banks. There is also a continuous interest for bank to use rule based classifiers to build their default prediction models. However, in practice the data records are usually incomplete and have some missing values and this make problems for banks, especially in credit risk portfolios which are low default and makes model rule based building complex. Several strategies could be used in order to handle the missing data issue. This paper used five missing value handling strategies including; ignoring, replacing with random, mean, C R tree induced values and elimination strategies in a real credit scoring dataset. Experimental results show that ignoring strategy consistently outperforms other methods on test data set, and suggest that the CHAID is a useful classifier for handling low default portfolios with missing value.

Keywords :

Credit Scoring , Banking Industry , Rule extraction , Missing data , Low default portfolio

Journal title :

Journal of Industrial Engineering and Management Studies

Serial Year :

2018

Journal title :

Journal of Industrial Engineering and Management Studies

Record number :

2462457

Link To Document :

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=2462457