DocumentCode :
1116058
Title :
On Dimensionality, Sample Size, Classification Error, and Complexity of Classification Algorithm in Pattern Recognition
Author :
Raudys, Sarunas ; Pikelis, Vitalijus
Author_Institution :
Lietuvos RSR Moksly, Adademiha, Lenino, U.S.S.R.
Issue :
3
fYear :
1980
fDate :
5/1/1980 12:00:00 AM
Firstpage :
242
Lastpage :
252
Abstract :
This paper compares four classification algorithms-discriminant functions when classifying individuals into two multivariate populations. The discriminant functions (DF´s) compared are derived according to the Bayes rule for normal populations and differ in assumptions on the covariance matrices´ structure. Analytical formulas for the expected probability of misclassification EPN are derived and show that the classification error EPN depends on the structure of a classification algorithm, asymptotic probability of misclassification P¿, and the ratio of learning sample size N to dimensionality p:N/p for all linear DF´s discussed and N2/p for quadratic DF´s. The tables for learning quantity H = EPN/P¿ depending on parameters P¿, N, and p for four classifilcation algorithms analyzed are presented and may be used for estimating the necessary learning sample size, detennining the optimal number of features, and choosing the type of the classification algorithm in the case of a limited learning sample size.
Keywords :
Algorithm design and analysis; Classification algorithms; Covariance matrix; Error correction; Euclidean distance; Lead; Measurement standards; Pattern analysis; Pattern recognition; Vectors; Classification error; dimensionality; discriminant functions; pattern recognition; sample size;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.1980.4767011
Filename :
4767011
Link To Document :
بازگشت