DocumentCode :
384285
Title :
On machine learning, ROC analysis, and statistical tests of significance
Author :
Maloof, Marcus A.
Author_Institution :
Dept. of Comput. Sci., Georgetown Univ., Washington, DC, USA
Volume :
2
fYear :
2002
fDate :
2002
Firstpage :
204
Abstract :
Receiver operating characteristic (ROC) analysis is being used with greater frequency as an evaluation methodology in machine learning and pattern recognition. Researchers have used ANOVA to determine if the results from such analysis are statistically significant. Yet, in the medical decision making community, the prevailing method is LABMRMC. Although this latter method uses ANOVA, before doing so, it applies the Jackknife method to account for case-sample variance. To determine whether these two tests make the same decisions regarding statistical significance, we conducted a Monte Carlo simulation using several problems derived from Gaussian distributions, three machine-learning algorithms, ROC analysis, ANOVA, and LABMRMC. Results suggest that the decisions these tests make are not the same, even for simple problems. Furthermore, the larger issue is that since ANOVA does not account for case-sample variance, one cannot generalize experimental results to the population from which the data were drawn.
Keywords :
Gaussian distribution; Monte Carlo methods; decision making; learning (artificial intelligence); medical expert systems; sensitivity analysis; ANOVA; Gaussian distributions; Jackknife method; LABMRMC; Monte Carlo simulation; ROC analysis; case-sample variance; machine learning; machine-learning algorithms; medical decision making community; pattern recognition; receiver operating characteristic analysis; statistical significance; statistical tests of significance; Algorithm design and analysis; Analysis of variance; Computer science; Decision making; Humans; Learning systems; Machine learning; Monte Carlo methods; Performance analysis; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2002. Proceedings. 16th International Conference on
ISSN :
1051-4651
Print_ISBN :
0-7695-1695-X
Type :
conf
DOI :
10.1109/ICPR.2002.1048273
Filename :
1048273
Link To Document :
بازگشت