Title :
RMS bounds and sample size considerations for error estimation in linear discriminant analysis
Author :
Zollanvari, Amin ; Braga-Neto, Ulisses M. ; Dougherty, Edward R.
Author_Institution :
Dept. of Electr. & Comput. Eng., Texas A&M Univ., College Station, TX, USA
Abstract :
The validity of a classifier depends on the precision of the error estimator used to estimate its true error. This paper considers the necessary sample size to achieve a given validity measure, namely RMS, for resubstitution and leave-one-out error estimators in the context of LDA. It provides bounds for the RMS between the true error and both the resubstitution and leave-one-out error estimators in terms of sample size and dimensionality. These bounds can be used to determine the minimum sample size in order to obtain a desired estimation accuracy, relative to RMS. To show how these results can be used in practice, a microarray classification problem is presented.
Keywords :
biological techniques; biology computing; error statistics; pattern classification; statistical analysis; RMS bounds; classifier validity; error estimator precision; estimation accuracy; leave one out error estimators; linear discriminant analysis; microarray classification problem; minimum sample size determination; resubstitution error estimators; root mean square bounds; sample size considerations; true error estimation; Approximation methods; Bioinformatics; Context; Covariance matrix; Error analysis; Estimation; Training;
Conference_Titel :
Genomic Signal Processing and Statistics (GENSIPS), 2010 IEEE International Workshop on
Conference_Location :
Cold Spring Harbor, NY
Print_ISBN :
978-1-61284-791-7
DOI :
10.1109/GENSIPS.2010.5719691