DocumentCode :
2600857
Title :
RMS bounds and sample size considerations for error estimation in linear discriminant analysis
Author :
Zollanvari, Amin ; Braga-Neto, Ulisses M. ; Dougherty, Edward R.
Author_Institution :
Dept. of Electr. & Comput. Eng., Texas A&M Univ., College Station, TX, USA
fYear :
2010
fDate :
10-12 Nov. 2010
Firstpage :
1
Lastpage :
4
Abstract :
The validity of a classifier depends on the precision of the error estimator used to estimate its true error. This paper considers the necessary sample size to achieve a given validity measure, namely RMS, for resubstitution and leave-one-out error estimators in the context of LDA. It provides bounds for the RMS between the true error and both the resubstitution and leave-one-out error estimators in terms of sample size and dimensionality. These bounds can be used to determine the minimum sample size in order to obtain a desired estimation accuracy, relative to RMS. To show how these results can be used in practice, a microarray classification problem is presented.
Keywords :
biological techniques; biology computing; error statistics; pattern classification; statistical analysis; RMS bounds; classifier validity; error estimator precision; estimation accuracy; leave one out error estimators; linear discriminant analysis; microarray classification problem; minimum sample size determination; resubstitution error estimators; root mean square bounds; sample size considerations; true error estimation; Approximation methods; Bioinformatics; Context; Covariance matrix; Error analysis; Estimation; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Genomic Signal Processing and Statistics (GENSIPS), 2010 IEEE International Workshop on
Conference_Location :
Cold Spring Harbor, NY
ISSN :
2150-3001
Print_ISBN :
978-1-61284-791-7
Type :
conf
DOI :
10.1109/GENSIPS.2010.5719691
Filename :
5719691
Link To Document :
بازگشت