Title :
Classification Bias of the k-Nearest Neighbor Algorithm
Author_Institution :
Geometric Data, 999 West Valley Rd., Wayne PA 19087.
fDate :
5/1/1984 12:00:00 AM
Abstract :
The k-nearest neighbor classifier has been used extensively in pattern analysis applications. This classifier can, however, have substantial bias when there is little class separation and the sample sizes are unequal. This classification bias is examined for the two-class situation and formulas presented that allows selection of values of k that yields minimum bias.
Keywords :
Breast cancer; Cancer detection; Density functional theory; Distribution functions; Error analysis; Frequency estimation; Nearest neighbor searches; Pattern analysis; Performance analysis; Yield estimation; Analysis of class separation; error estimation; k-nearest neighbor classifier; nonparametric discrimination;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.1984.4767533