DocumentCode :
2864997
Title :
Ranking-based evaluation of regression models
Author :
Rosset, Saharon ; Perlich, Claudia ; Zadrozny, Bianca
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2005
fDate :
27-30 Nov. 2005
Abstract :
We suggest the use of ranking-based evaluation measures for regression models, as a complement to the commonly used residual-based evaluation. We argue that in some cases, such as the case study we present, ranking can be the main underlying goal in building a regression model, and ranking performance is the correct evaluation metric. However, even when ranking is not the contextually correct performance metric, the measures we explore still have significant advantages: They are robust against extreme outliers in the evaluation set; and they are interpretable. The two measures we consider correspond closely to non-parametric correlation coefficients commonly used in data analysis (Spearman\´s p and Kendall\´s r); and they both have interesting graphical representations, which, similarly to ROC curves, offer useful "partial" model performance views, in addition to a one-number summary in the area under the curve. We illustrate our methods on a case study of evaluating IT wallet size estimation models for IBM\´s customers.
Keywords :
correlation methods; data analysis; regression analysis; data analysis; evaluation metric; graphical representation; model performance views; nonparametric correlation coefficients; ranking performance; ranking-based evaluation; regression model; residual-based evaluation; Area measurement; Costs; Data analysis; Data mining; Marketing and sales; Pediatrics; Predictive models; Robustness; Statistics; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, Fifth IEEE International Conference on
ISSN :
1550-4786
Print_ISBN :
0-7695-2278-5
Type :
conf
DOI :
10.1109/ICDM.2005.126
Filename :
1565701
Link To Document :
بازگشت