• DocumentCode
    2600488
  • Title

    Approximate expressions for the variances of non-randomized error estimators and CoD estimators for the discrete histogram rule

  • Author

    Chen, Ting ; Braga-Neto, Ulisses

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Texas A&M Univ., College Station, TX, USA
  • fYear
    2010
  • fDate
    10-12 Nov. 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Estimation of the classification error and of the coefficient of determination (CoD) is a fundamental issue in discrete prediction problems. Analytical expressions for exact performance metrics of non-randomized error estimators and CoD estimators have been derived in previous publications by the authors. However, computation of these expressions becomes problematic as the sample size or predictor complexity increases, particularly in the case of second moments. Thus, fast and accurate approximations are desirable. In this paper, we make approximations to the variances of resubstitution and leave-one-out error estimators and CoD estimators. Our results show that these approximations not only are quite accurate but also reduce computation time tremendously.
  • Keywords
    bioinformatics; error analysis; genetics; prediction theory; CoD estimators; classification error; coefficient of determination; computation time; discrete histogram rule; discrete prediction problem; leave-one-out error estimators; nonrandomized error estimators; resubstitution; variances; Approximation methods; Complexity theory; Error analysis; Estimation; Histograms; Joints; Tin; CoD estimation; Discrete prediction; discrete histogram rule; error estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics (GENSIPS), 2010 IEEE International Workshop on
  • Conference_Location
    Cold Spring Harbor, NY
  • ISSN
    2150-3001
  • Print_ISBN
    978-1-61284-791-7
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2010.5719673
  • Filename
    5719673