Title :
Cost-sensitive learning by cost-proportionate example weighting
Author :
Zadrozny, Bianca ; Langford, John ; Abe, Naoki
Author_Institution :
Dept. of Math. Sci., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
We propose and evaluate a family of methods for converting classifier learning algorithms and classification theory into cost-sensitive algorithms and theory. The proposed conversion is based on cost-proportionate weighting of the training examples, which can be realized either by feeding the weights to the classification algorithm (as often done in boosting), or by careful subsampling. We give some theoretical performance guarantees on the proposed methods, as well as empirical evidence that they are practical alternatives to existing approaches. In particular, we propose costing, a method based on cost-proportionate rejection sampling and ensemble aggregation, which achieves excellent predictive performance on two publicly available datasets, while drastically reducing the computation required by other methods.
Keywords :
cost-benefit analysis; learning (artificial intelligence); pattern classification; sampling methods; support vector machines; classification algorithm; cost-proportionate example weighting; cost-proportionate rejection sampling; cost-sensitive learning algorithms; support vector machines; Boosting; Classification algorithms; Computer crime; Costing; Costs; Data mining; High performance computing; Machine learning; Medical diagnosis; Sampling methods;
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
DOI :
10.1109/ICDM.2003.1250950