• DocumentCode
    259682
  • Title

    OUPS: A Combined Approach Using SMOTE and Propensity Score Matching

  • Author

    Rivera, William A. ; Goel, Amit ; Kincaid, J. Peter

  • Author_Institution
    Inst. for Simulation Training, Univ. of Central Florida, Orlando, FL, USA
  • fYear
    2014
  • fDate
    3-6 Dec. 2014
  • Firstpage
    424
  • Lastpage
    427
  • Abstract
    Building accurate classifiers is difficult when using data that is skewed or imbalanced which is typical of real world data sets. Two popular approaches that have been applied for improving classification accuracy and statistical comparisons of imbalanced data sets are: synthetic minority over-sampling technique (SMOTE) and propensity score matching (PSM). A novel sampling approach is introduced referred to as over-sampling using propensity scores (OUPS) that blends the two and is simple and easy to perform resulting in improvement in accuracy and sensitivity over both SMOTE and PSM. The performance of our proposed approach is assessed using a simulation experiment and several performance metrics are shown where this approach fares and falls in comparison to the others.
  • Keywords
    pattern classification; statistical analysis; OUPS; PSM; SMOTE; classification accuracy; novel sampling approach; over-sampling using propensity scores; propensity score matching; real world data sets; statistical comparisons; synthetic minority over-sampling technique; Accuracy; Data models; Equations; Machine learning algorithms; Sensitivity; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2014 13th International Conference on
  • Conference_Location
    Detroit, MI
  • Type

    conf

  • DOI
    10.1109/ICMLA.2014.106
  • Filename
    7033153