Title : 
A criterion for choosing between full-sample and hold-out classifier design
         
        
            Author : 
Brun, Marcel ; Xu, Qian ; Dougherty, Edward R.
         
        
            Author_Institution : 
Univ. Nac. de Mar del Plata, Mar del Plata
         
        
        
        
        
        
            Abstract : 
Is it better to design a classifier and estimate its error on the full sample or to design a classifier on a training subset and estimate its error on the hold-out test subset? Full-sample design provides the better classifier; nevertheless, one might choose hold-out with the hope of better error estimation. A conservative criterion to decide the best course is to aim at a classifier whose error is less than a given bound. Then the choice between full-sample and hold-out design depends on which possesses the smaller expected bound. Using this criterion, we examine the choice between hold-out and several full-sample error estimators using covariance models. The relation between the two designs is revealed via a decomposition of the expected bound into the sum of the expected true error and the expected conditional standard deviation of the true error.
         
        
            Keywords : 
covariance analysis; error analysis; genetics; medical computing; pattern classification; classifier error estimation; conservative criterion; covariance models; error bound decomposition; full sample classifier design; hold out classifier design; true error conditional standard deviation; Bioinformatics; Computational biology; Computer errors; Error analysis; Genomics; Process design; Sampling methods; Strontium; Testing; Tin;
         
        
        
        
            Conference_Titel : 
Genomic Signal Processing and Statistics, 2008. GENSiPS 2008. IEEE International Workshop on
         
        
            Conference_Location : 
Phoenix, AZ
         
        
            Print_ISBN : 
978-1-4244-2371-2
         
        
            Electronic_ISBN : 
978-1-4244-2372-9
         
        
        
            DOI : 
10.1109/GENSIPS.2008.4555662