DocumentCode :
2370022
Title :
Cost-sensitive learning by cost-proportionate example weighting
Author :
Zadrozny, Bianca ; Langford, John ; Abe, Naoki
Author_Institution :
Dept. of Math. Sci., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2003
fDate :
19-22 Nov. 2003
Firstpage :
435
Lastpage :
442
Abstract :
We propose and evaluate a family of methods for converting classifier learning algorithms and classification theory into cost-sensitive algorithms and theory. The proposed conversion is based on cost-proportionate weighting of the training examples, which can be realized either by feeding the weights to the classification algorithm (as often done in boosting), or by careful subsampling. We give some theoretical performance guarantees on the proposed methods, as well as empirical evidence that they are practical alternatives to existing approaches. In particular, we propose costing, a method based on cost-proportionate rejection sampling and ensemble aggregation, which achieves excellent predictive performance on two publicly available datasets, while drastically reducing the computation required by other methods.
Keywords :
cost-benefit analysis; learning (artificial intelligence); pattern classification; sampling methods; support vector machines; classification algorithm; cost-proportionate example weighting; cost-proportionate rejection sampling; cost-sensitive learning algorithms; support vector machines; Boosting; Classification algorithms; Computer crime; Costing; Costs; Data mining; High performance computing; Machine learning; Medical diagnosis; Sampling methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
Type :
conf
DOI :
10.1109/ICDM.2003.1250950
Filename :
1250950
Link To Document :
بازگشت