Title of article
Overestimation of the receiver operating characteristic curve for logistic regression
Author/Authors
Copas، J.B. نويسنده , , P.Corbett، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2002
Pages
-314
From page
315
To page
0
Abstract
Logistic regression is often used to find a linear combination of covariates which best discriminates between two groups or populations.The ROC, receiver operating characteristic, curve is a good way of assessing the performance of the resulting score, but using the same data both to fit the score and to calculate its ROC leads to an over-optimistic estimate of the performance which the score would give if it were to be validated on a sample of future cases. The paper studies the extent of this overestimation, and suggests a shrinkage correction for the ROC curve itself and for the area under the curve. The correction is consistent with Efronʹs formula for the bias in the error rate of a binary prediction rule. Two medical examples are discussed.
Keywords
Particle filter , importance sampling , Metropolis–Hastings , Mixture model , Batch importance sampling , Markov chain Monte Carlo , Parallel processing , Generalised linear model
Journal title
Biometrika
Serial Year
2002
Journal title
Biometrika
Record number
71806
Link To Document