DocumentCode
1007196
Title
The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier
Author
Veenman, Cor J. ; Reinders, Marcel J T
Author_Institution
Dept. of Mediamatics, Delft Univ. of Technol., Netherlands
Volume
27
Issue
9
fYear
2005
Firstpage
1417
Lastpage
1429
Abstract
We present the nearest subclass classifier (NSC), which is a classification algorithm that unifies the flexibility of the nearest neighbor classifier with the robustness of the nearest mean classifier. The algorithm is based on the maximum variance cluster algorithm and, as such, it belongs to the class of prototype-based classifiers. The variance constraint parameter of the cluster algorithm serves to regularize the classifier, that is, to prevent overfitting. With a low variance constraint value, the classifier turns into the nearest neighbor classifier and, with a high variance parameter, it becomes the nearest mean classifier with the respective properties. In other words, the number of prototypes ranges from the whole training set to only one per class. In the experiments, we compared the NSC with regard to its performance and data set compression ratio to several other prototype-based methods. On several data sets, the NSC performed similarly to the k-nearest neighbor classifier, which is a well-established classifier in many domains. Also concerning storage requirements and classification speed, the NSC has favorable properties, so it gives a good compromise between classification performance and efficiency.
Keywords
data compression; pattern classification; pattern clustering; data set compression ratio; maximum variance cluster algorithm; nearest mean classifier; nearest neighbor classifier; nearest subclass classifier; prototype-based classifiers; Classification algorithms; Clustering algorithms; Gaussian distribution; Nearest neighbor searches; Predictive models; Protocols; Prototypes; Robustness; Stress; Training data; Index Terms- Classification; cross-validation; prototype selection.; regularization; Algorithms; Artificial Intelligence; Cluster Analysis; Models, Statistical; Pattern Recognition, Automated;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/TPAMI.2005.187
Filename
1471707
Link To Document