• DocumentCode
    423700
  • Title

    Using permutations instead of student´s t distribution for p-values in paired-difference algorithm comparisons

  • Author

    Menke, Joshua ; Martinez, Tony R.

  • Author_Institution
    Dept. of Comput. Sci., Brigham Young Univ., Provo, UT, USA
  • Volume
    2
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Firstpage
    1331
  • Abstract
    The paired-difference t-test is commonly used in the machine learning community to determine whether one learning algorithm is better than another on a given learning task. This paper suggests the use of the permutation test instead because it calculates the exact p-value instead of an estimate. The permutation test is also distribution free and the time complexity is trivial for the commonly used 10-fold cross-validation paired-difference test. Results of experiments on real-world problems suggest it is not uncommon to see the t-test estimate deviate up to 30-50% from the exact p-value.
  • Keywords
    computational complexity; learning (artificial intelligence); statistical testing; 10 fold cross validation paired difference t-test; machine learning algorithm; p-values permutation test; time complexity; Computer science; Machine learning; Machine learning algorithms; Packaging; Probability; Robustness; Statistical analysis; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1380138
  • Filename
    1380138