• DocumentCode
    3128346
  • Title

    Domain-Specific Adaptation of a Partial Least Squares Regression Model for Loan Defaults Prediction

  • Author

    Srinivasan, Balaji Vasan ; Gnanasambandam, Nathan ; Zhao, Shi ; Minhas, Raj

  • Author_Institution
    Inst. for Adv. Comput. Studies, Univ. of Maryland, College Park, MD, USA
  • fYear
    2011
  • fDate
    11-11 Dec. 2011
  • Firstpage
    474
  • Lastpage
    479
  • Abstract
    Loan management agencies monitor several loan related attributes for tracking the condition and quality of their financial portfolios. If the trend of loan related status is understood well, the agency would be able to proactively take actions to avoid prolonged delinquency and loan defaults. If an early warning system is available to predict the risk with a loan well-ahead of time, the agency can potentially take corrective measures to prevent the loan from defaulting. In this paper, we use a partial least squares (PLS) regression to model the status of a loan quantized to a non-linear scale of 0 to 100 (where the severity function is built with inputs from domain experts). We use the associated "Variable Influence on Projection" or VIP scores to select the useful variables for better prediction. In order to address the imbalance in the categories of the observed records (typically the number of low risk records are much more than the risky records), we propose a multi-PLS model for loan prediction. We further enhance the model outputs based on certain domain- specific indicator variables. The resulting model shows improved predictive capacity against a direct application of the PLS model.
  • Keywords
    investment; least squares approximations; regression analysis; VIP scores; domain- specific indicator variables; domain-specific adaptation; early warning system; financial portfolios; loan defaults prediction; loan management agencies; multiPLS model; partial least squares regression model; variable influence on projection; Accuracy; Computational modeling; Data models; Input variables; Monitoring; Predictive models; Vectors; indicator variable based boosting; loan defaults prediction; partial least squares; variable influence on projection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver, BC
  • Print_ISBN
    978-1-4673-0005-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2011.69
  • Filename
    6137417